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Pay people to fight malaria 


Workers on the front line of the battle against the disease are too often overlooked as scientists and 


funders plot how to defeat malaria. 


ature sent a reporter to Myanmar, Cambodia and Thai- 
| \ land in May to see how researchers are helping countries in 
the Greater Mekong region reach their goal of eliminating 
malaria by 2030. Her News Feature on page 458 highlights how politi- 
cal, financial and geographical obstacles allow the disease to persist. 
The broader malaria community, including researchers and funders, 
must do more to address these challenges. It is a race against time. 
The disease must be stamped out in the region now, while it remains 
relatively rare. Allow it to increase, and the chance will be lost. Worse, 
drug-resistant parasites there will multiply and spread. 

It’s no secret that the best way to beat malaria is to ensure that 
every infected person can quickly reach a health worker and be given 
artemisinin-based combination therapies (ACTs) — pills that swiftly 
kill the parasites that cause malaria before they spread to other people. 
The importance of health workers is highlighted by what happens in 
their absence. Hundreds of thousands of African children die every 
year because they do not get the pills in time. And in Venezuela — once 
declared malaria-free — the disease has come back with a vengeance 
as political instability has fractured health services. 

A review of 75 malaria resurgences around the world from 1930 to 
2011 found that 90% were caused by lapses in basic malaria-control 
measures — mosquito control (mainly through insecticides) and 
health systems (J. M. Cohen et al. Malaria J. 11, 122; 2012). Insecticides 
are potent tools for reducing malaria, but resistance is on the rise. Bed 
nets also reduce risk, but can’t always prevent the disease. A strong 
public-health system is necessary because there is no powerful vaccine 
for malaria, unlike other illnesses subject to elimination campaigns, 
such as polio. 

Yet health systems in many malaria-endemic nations are riddled 
with holes. Malaria health workers are often unsalaried, so many leave 
for other jobs. Networks of researchers, technicians and administra- 
tors support health workers, provide surveillance and inform policy. 
But health budgets in low-income nations are often too small to attract 
enough of these skilled workers. 

At present, too many of these crucial positions are unpaid. The Bill 
& Melinda Gates Foundation in Seattle, Washington, the US govern- 
ment and many other donors do not pay salaries for health workers 
and other staff in national health systems. The Global Fund to Fight 
AIDS, Tuberculosis and Malaria is a rare exception in supplementing 
pay for national health workers — but even it does not allocate funds 
to researchers and officials at higher levels in ministries of health. It 
says that national governments should do this, but many can’t or don't. 

The problem is complex, but there are a few ways in which the sci- 
entific community could help. Researchers can analyse the costs and 
value of health-system components and compare them with those 
of other projects, such as genetically modifying mosquitoes to wipe 
them out, or conducting vaccine studies. This would help donors 
and governments re-evaluate their priorities when funds are limited. 
Researchers can also help by developing simple tools and treatments 


for other diseases, thus allowing local malaria health workers to 
be more versatile. In this way, their services remain crucial even as 
malaria infections drop — as they have across southeast Asia. This 
added value means that governments might be more willing to pay 

workers. 
Furthermore, top-tier scientists in Western countries can work to 
boost the status of their colleagues in malaria-endemic regions. After 
all, local researchers working in government 


“The best way agencies are best placed to lobby their lead- 
to beat malaria ers for bigger health budgets. At the moment, 
istoensurethat these local researchers often provide a sup- 
every infected porting role to those from the United States 
personcan and Europe who are the lead investigators 
quickly reacha on large grants. The international scientific 


community should put more emphasis on 
providing long-term mentorship, building 
laboratories and helping colleagues to boost their careers. 

And — despite strong disincentives for doing so — experts should 
push back against funders and journals that favour ‘outside the box’ 
ideas over routine support for health workers in remote villages. For 
example, drones have been touted as a new means of delivering malaria 
drugs to remote areas. Researchers should ask for evidence that such 
expenditures are more valuable than improving the basic health system. 

Last month, the World Health Organization declared Paraguay to 
be malaria-free, even as cases were rising in Venezuela, Panama, Peru 
and Nicaragua. Public-health officials attributed the nation’s victory 
to its health system’s ability to quickly detect, treat and track cases. 
Sometimes, the classic recipes are the best. m 


health worker.” 


@ e@ 
Inspiring women 
Awards highlight impressive efforts that reach 
the next generation. 


excellent early-career female scientists, and the other acknowledges 
individuals or organizations that have led grass-roots initiatives to 
support increased access to, or interest in, science, technology, engi- 
neering and mathematics (STEM) for girls and young women around 
the world. This week, we announce the candidates who have made the 
longlist of ten for each Nature Research award, and congratulate all 
of them. The impressive breadth and quality of all the entries deserve 
wide recognition, too. 
There are many reasons for which women are discouraged from 
entering STEM professions or from staying in them. They include 


I: April, Nature announced two annual prizes: one recognizes 
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unequal access to education; conscious and unconscious discrimina- 
tion in promotion, funding, peer review, citation and more; unequal 
pay; and, often, a disproportionate burden of domestic responsibili- 
ties. All of that can help to explain why women still make up only 
around 30% of professional researchers. It is encouraging to see the 
many entries from women who are successful researchers and who 
are also determined to help others and to inspire the next generation. 
One route towards equality in science is to offer positive role models 
to young people. 

Some of the most impressive prize entrants focus on countries and 
locations in the global south, working with particularly impoverished 
and underprivileged communities. This often comes with additional 
challenges — sometimes, just reaching remote locations to mentor 
girls and encourage them to pursue science requires serious commit- 
ment. For example, one project, Cielo y Tierra, organizes outreach 
efforts in rural South America and Africa. 

Obviously, not all of the girls and young women reached in this way 
will end up pursuing science in their studies or careers. But awaken- 
ing their curiosity and nurturing their ambitions is still worthwhile. 

It turns out that mentorship and support over long distances can 
be highly effective. One scheme has successfully matched research- 
ers with young girls from low-income communities as penfriends. 
Working with US schools and with scientists around the world, 
‘Letters to a Pre-Scientist’ fosters a love for science, offers support 
and provides personal experiences. Scientists are encouraged to 
write at least once every few months, and to give instances of how 
science has, for example, taught them to persevere or to learn from 


failure. It’s a simple and effective way to demystify science by creating 
personal connections. Perhaps you should try it. 

For students already in the university environment, gaining access 
to female scientists might still be difficult. Therefore, many award 
applicants have set up visiting female researchers’ programmes, 
lecture series focused on female speakers, or social events that enable 
networking with prominent female researchers. 

In the global north, many of the efforts 


“One route focus on girls and young women from minor- 
towards equality _ ity ethnic groups. Others target communities 
in science is to of refugees, the children of female prison 
offer positive inmates and those who live in orphanages. 

role models to Many of these projects, the prize entrants 


told us, are driven by the entrants’ own expe- 
riences as women in science. For example, 
some scientists who are also mothers are working to improve childcare 
provision at scientific conferences. Others have overcome particularly 
strong local cultural constraints, under which women are rarely seen 
in public, let alone in science and education. 

All of the entrants believe that women should have the opportunity, 
agency and support to become researchers. We agree. (And we appre- 
ciate that gender is neither binary nor necessarily fixed.) Together, 
these examples show the enormous range of ways to inspire girls and 
young women, and they offer lessons for all. The full longlists for the 
two prizes — the Inspiring Science Award and the Innovating Science 
Award — are available at go.nature.com/2jmri9z. The winners will be 
announced in October. m 


young people.” 


Publish not perish 


Conservationists have a new tool for weighing 
up the risks and benefits of open data. 


night parrot was forbidden. Long feared extinct, when the bird 

(Pezoporus occidentalis) was rediscovered, officials in Australia 
decided to restrict both location data and recordings of its signature 
call for fear that poachers and enthusiasts might use the information to 
track and disturb the creatures. Yet when the recordings were declas- 
sified and shared last year, conservationists were delighted by what 
followed: at least three new populations have since been discovered 
by people using the call to recognize the birds. 

Since the bird’s rediscovery in 2013, the Australian government has 
put in place proper conservation safeguards, such as making it illegal 
to approach the creature’s habitat. It’s a good example of authorities 
weighing up the risks and benefits of publicizing the discovery of a 
rare species and then reaching a sensible compromise. That kind of 
decision process is increasingly in demand, as data sources and shar- 
ing proliferate beyond conventional academic audiences in ways that 
risk, for example, helping hunters and illegal wildlife traders to track 
down target species. 

Ina Perspective published this week in Nature Ecology & Evolution 
(A.1.T. Tulloch et al. Nature Ecol. Evol. 2, 1209-1217; 2018), conserva- 
tion experts offer a way to help scientists and officials to decide when 
to publish such sensitive information — and when not to. It’s the latest 
development in an ongoing debate that pits advocates of open data 
against those who take a harder line and want more restrictions. The 
authors warn that a default position in which location data are with- 
held if a species is identified as being of high biological significance 
and under high threat — as recommended by the Global Biodiversity 
Information Facility — risks missing out on the benefits of data sharing. 

To aim for a more balanced approach, the scientists drew up a deci- 
sion tree to help people judge what to do with information gained from 


Fee 2013 to 2017, listening to the gentle ‘ding ding’ of the 
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wildlife monitoring and surveys. A series of steps asks questions such 
as “Could data be used to mitigate threats to species?” and “Would 
sharing location data increase risk of species decline through increased 
visitation?” In some cases — fish spawning locations for one, because 
the fishing industry would love to target them — the recommendation 
is to keep everything from the name ofa species to its location under 
wraps. But in other cases, the need for secrecy is trumped by the pos- 
sible benefits of transparency. Open data could help local communities 
fight to protect a habitat when development is threatening a species. 

Ayesha Tulloch, an environmental scientist at the University of 
Sydney in Australia who led the analysis, says her team was surprised 
by the low number of examples they tried that produced a ‘don't pub- 
lish’ decision. That, she adds, could help scientists to get funding for 
projects that might otherwise be rejected because of the restrictions 
expected to be placed on the eventual results. Several government 
departments in Australia and New Zealand — including those who 
have historically kept data from the public — have already said that 
they are interested in using the decision framework to help set policy. 

Such discussions are timely. Scientists are poised to launch a 
satellite-tracking project that will massively increase the number of 
species worldwide whose locations can be traced in near-real time. 
Earlier this year, a satellite antenna called ICARUS — International 
Cooperation for Animal Research Using Space — was attached to 
the International Space Station. Combined with lightweight animal 
tags, it will allow researchers to follow the long-range movements of 
much smaller species than has previously been possible. For some 
species, that information could place them at greater risk. There are 
already concerns that unethical safari guides and hunters have hacked 
into the biotagging systems used to keep tabs on endangered animals 
such as lions (S. J. Cooke et al. Conserv. Biol. 31, 1205-1207; 2017). 

About one in six of the threatened species listed by the International 
Union for Conservation of Nature are classed as data-deficient, which 
means conservationists and scientists face a struggle to perform basic 
analyses such as population modelling and to work out which habitats 
must be preserved. Data bring power. But they also bring responsibil- 
ity. This new decision-making tool could help to generate the former 
while respecting the latter. m 
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forms arXiv and bioRxiv, which make manuscripts available 

before they have been peer reviewed and accepted by a journal. 
Scientists applaud preprints because they enable researchers to claim 
priority and make their findings available more quickly, unshackled 
from sluggish and tyrannical journals. 

This might make sense within the scientific community, but as 
someone who has worked for years with researchers and journalists 
to ensure responsible coverage of science in the media, I fear that this 
method of publication holds substantial risks for the broader com- 
munity — risks that are not being given proper consideration by the 
champions of preprint. Weak work that hasn’t been reviewed could 
get overblown in the media. Conversely, better work could be ignored. 

Those who run preprint servers, researchers who submit to them 
and anyone considering publicizing a preprint 
have a responsibility to take note of these risks. 
The responses we received to an open letter we 
posted on 17 July (see go.nature.com/2uxf86x) 
have not entirely reassured me that this is the 
case. Here, I call on scientists to reflecton — and 
find ways to limit — the potential of preprints 
to elicit public confusion and misinformation. 

Many people still learn about science the same 
way they learn about Vladimir Putin, Syria or the 
World Cup: through news sites, television and 
radio. The bulk of research reported through 
these channels is peer reviewed. A few days before 
a paper is published, the science journal will issue 
a restricted press release to qualified journalists under an agreement 
(called an embargo) that no one will report on the paper until a desig- 
nated time. The embargo system has its flaws, but it does give reporters 
time to assess the research and gather expert reaction. 

Contrast this with preprints. As soon as research is in the public 
domain, there is nothing to stop a journalist writing about it, and rush- 
ing to be the first to do so. Imagine early findings that seem to show 
that climate change is natural or that a common vaccine is unsafe. 
Preprints on subjects such as those could, if they become a story that 
goes viral, end up misleading millions, whether or not that was the 
intention of the authors. 

Many reporters (including those at Nature) have written stories 
based on preprints. I'll admit that we do not yet have examples of harm 
from such stories, but this is probably because — at the moment — only 
atiny fraction of preprints cover health-related or controversial fields. 

What we have no shortage of are examples in which public under- 
standing has been distorted by media coverage of ambiguous or just 
downright bad science. Take the (now retracted) paper which claimed 
that genetically modified maize (corn) gave rats cancer (G.-E. Séra- 
lini et al. Food Chem. Toxicol. 50, 4221-4231; 2012) — its carefully 
orchestrated publicity campaign ensured that journalists could not 
seek outside scrutiny before running their news stories. The science 


ik: of papers are submitted every month to the plat- 


HOW CAN 
WE HAVE 


PREPRINTS 


AND SUPPORT 


GOOD 
JOURNALISM? 


Preprints could promote 
confusion and distortion 


The scientific community must take measures to keep preprints from 
distorting the public’s understanding of science, says Tom Sheldon. 


was poor, but the claims were intriguing. Many journalists gave up the 
chance to vet information so as not to be late to the story. 

Some scientists have pointed out that preprints are similar to 
conference presentations, which are also not peer reviewed. In my expe- 
rience, conference proceedings often make the most misleading stories 
of all. Take a poster from the Pediatric Academic Societies meeting in 
April-May 2016, which looked at the incidence of autism and pesticide 
application in New York state. This was extremely early work that could 
not definitively establish a pattern, let alone a causal link. Still, several 
mainstream outlets rushed to cover it with some alarming headlines, 
infuriating scientists. Preprints could create similar pressures, and simi- 
larly flawed coverage — but much more often, and ona larger scale. 

Another risk is the inverse — and this one could matter more to 
some researchers. Under the preprint system, one intrepid journal- 
ist trawling the servers can break a story; by the 
time other reporters have noticed, it’s old news, 
and they can’t persuade their editors to publish. 

There have been cases in which a preprint 
that garnered news stories got a second flurry 
of coverage when it was published in a journal. 
But generally, the rule is ‘it has to be new to be 
news. Reacting to our blog, Tom Whipple, sci- 
ence editor of The Times in the United Kingdom, 
tweeted: “I'm not sure how to keep a newspaper 
in profitable existence that decides to give people 
news they've already read on the BBC” 

It is not enough to shrug and blame journalists, 
and itis unhelpful to dismiss those journalists who 
can accurately convey complex science to a mass audience. Scientists 
need to be part of these debates — with their eyes open to how the 
media works. Journalists do include appropriate caveats or even decide 
not to run a story when conclusions are tentative, but that happens only 
because they have been given enough time and breathing space to assess 
it. If the scientific community isn’t careful, preprints could take that 
resource away. 

How can we have preprints and support good journalism? Should 
scientific societies or preprint advocates develop guidelines for what 
should and should not be posted as a preprint? Should there be a pre- 
print moratorium on any research with public-health implications? 
Should universities or researchers ever publicize a preprint? Should 
all preprints be emblazoned with a warning aimed at journalists that 
work has not been peer reviewed? 

Preprints could bring great prizes for science. But these questions 
must be brought up now, so that public understanding is not damaged 
as preprints flourish. = 


Tom Sheldon is a senior press manager at the Science Media Centre 
(SMC) in London. Funders of the SMC include Springer Nature, 
Nature’ publisher. 

e-mail: tom@sciencemediacentre.org 
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Quark finder 

Physicist Burton Richter, who 
shared the 1976 Nobel Prize 
in Physics, died on 18 July, 
aged 87. In the 1960s, Richter 
helped to design a ring-shaped 
particle collider at the Stanford 
Linear Accelerator Center 
(SLAC) in California, which 
he used to discover a new 
subatomic particle in 1974. 
This particle, later named 

J/y, proved the existence of 

a fourth quark (known as 
charm) and garnered Richter 
the Nobel, which he shared 
with physicist Samuel Ting. 
Richter was the director of 
SLAC from 1984 to 1999, 
during which time he oversaw 
numerous developments in 
particle accelerators. 


Three new ages 


The International Union of 
Geological Sciences (IUGS) 
has declared that we are living 
ina geological time period 
called the Meghalayan age. 
The period covers the most 
recent 4,200 years of Earth’s 
4.6-billion-year history and 
is one of three ages newly 
named by the organization. 
Each is a climatically distinct 
period of the current 
Holocene epoch, which began 
at the end of the last ice age, 
about 11,700 years ago. The 
Meghalayan age started with 
a 200-year global drought 
recorded in sediment around 
the world; the official marker 
isa stalagmite in a cave in 
northeastern India. The 
preceding Northgrippian age 
was a relatively cold stage 
that began about 8,300 years 
ago. It followed the warmer 
Greenlandian age, which 
marked the start of the 
Holocene. Some scientists 
argue that the Holocene has 
given way to a new epoch 
called the Anthropocene, 
marked by human 


Fruit-fly brain mapped in epic detail 


Scientists have produced a 3D image of the 
brain of a fruit fly (Drosophila melanogaster) 
that is so detailed, researchers can trace 
connections between neurons across the 
entire organ. The data, published on 19 July 
(Z. Zheng et al. Cell http://doi.org/gdvjwe; 
2018), also revealed some neurons that have 
never been seen before. The scientists cut 

a fly’s brain — roughly the size of a poppy 


interference with the planet. 


But researchers disagree on 
when this proposed period 
would begin, and geological 
evidence for a new epoch is 
inconclusive. 


Antibiotic worries 


Many low-income countries 
lag behind in taking action 
to prevent bacteria from 
becoming resistant to 
antibiotics, the World 
Health Organization and 
two other United Nations 
agencies said in a report on 
18 July. Such steps include 
improving sanitation and 
hygiene. About one-quarter 
of health-care facilities in 
Africa still lack water, the 
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behaviour. 


report says. And just over 
half of the world’s population 
lives in countries that do not 
have a national system for 
monitoring the consumption 
of antibiotics. Roughly half 
of low-income countries do 
not require prescriptions for 
antimicrobials. Only 93 of 
194 member states reported 
having developed strategies 
to stall resistance, the review 


finds. 


Weapons opposition 
More than 2,600 people and 
195 organizations have signed 
a pledge not to support the 
development, manufacture, 
trade or use of lethal 
autonomous weapons. The 
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seed — into more than 7,000 slices and shot 

a beam of electrons through them. A high- 
speed camera captured detailed pictures of 
each slice, generating about 21 million images 
that the team then combined using custom 
computer software. The resulting picture 
offers scientists a new tool with which to 
study the network of neurons behind fruit-fly 


signatories include SpaceX 
chief executive Elon Musk, 
three co-founders of Google 
DeepMind and thousands 
of researchers in the fields of 
computer science, robotics 
and artificial intelligence. In 
an open letter posted online 
on 18 July, they argue that 
“the decision to take a human 
life should never be delegated 
toa machine”. The pledge 
was initiated by the Future 
of Life Institute, a research 
and outreach organization in 
Cambridge, Massachusetts, 
which focuses on the 
existential threats posed by 
technology. In August 2017, 
more than 100 specialists 
working in artificial 


VISUALISATION: PHILIPP SCHLEGEL, DROSOPHILA CONNECTOMICS GROUP, CAMBRIDGE 


DR. SEUSS ENTERPRISES 


SOURCE: UK HOME OFFICE 


intelligence and robotics called 
on the United Nations to ban 
autonomous weapons. 


Sharing results 


A platform that allows 
researchers to share 
anonymized clinical-trial 
data, and that already boasts 
results from more than 

2,500 trials involving roughly 
1.3 million participants, 
launched on 19 July. The 
platform, developed by the 
non-profit company Vivli of 
Cambridge, Massachusetts, 
will also provide tools for 
analysing data across different 
databases, and aims to capture 
individual patient data from 
academic, industry and non- 
profit sponsors of clinical 
trials. More than 2,000 of the 
trials currently in the database 
were provided by London- 
based pharmaceutical giant 
GlaxoSmithKline. 


The Lorax’s origin 
The title character in the 
famed children’s book 

The Lorax may have been 
inspired by the patas monkey 
(Erythrocebus patas), a species 
found across West and East 
Africa, researchers reported 
on 23 July (N. J. Dominy et al. 
Nature Ecol. Evol. 2, 1196- 
1198; 2018). Theodor Seuss 
Geisel — better known by his 
pen name, Dr Seuss — wrote 
much of the book during a 
visit to Kenya in 1970. The 


TREND WATCH 


The production of genetically 
modified animals has levelled off 
in UK laboratories, after a decade 
that saw a 37% rise in the number 
bred. Last year, 1.9 million of these 
animals were produced, about the 
same as the year before, according 
to annual statistics released by 

the UK government on 19 July. In 
addition, 1.9 million experimental 
procedures, including those used 
for basic research, drug studies 
and surgical training, were 
conducted on animals last year — 
a7% drop since 2016 anda 17% 
decline since 2008. 


patas monkey is native to the 
country, and a morphometric 
analysis described in the 
study has revealed similarities 
between the faces of the patas 
monkey and the Lorax. 


Park pollution 


The air in US national 

parks contains just as much 
ozone pollution as the air 

in many of the country’s 
largest cities, according to a 
study published on 18 July 
(D. Keiser et al. Sci. Adv. 4, 
eaat1613; 2018). Researchers 
compared ozone levels in 

33 national parks and in 

20 of the largest US cities 
between 1990 and 2014. After 
controlling for weather and 
the season, the team found 
that pollution levels in the 
parks and cities were similar. 
In fact, before 2000, summer 
ozone levels in the parks 
increased before they started 
to drop. The study authors 


RESEARCH ANIMALS 


arent sure what caused this 
pattern. The findings raise 
health questions for park 
visitors because exposure 

to ozone pollution can 
exacerbate conditions such as 
asthma. 


POLICY 


Species in danger 
The US government 
announced several proposed 
changes to the Endangered 
Species Act, a 45-year-old 
law that protects thousands 
of threatened or endangered 
species. The proposals, 
announced on 19 July, aim to 
alter the protocols used for 
determining a species’ critical 
habitat and to streamline 

the process of listing and 
delisting organisms under 
the law. The US Fish and 
Wildlife Service, together 
with the National Marine 
Fisheries Service, would also 
change a current provision 
that automatically provides 
the same protections to 
threatened and endangered 
species, unless officials 
decide otherwise. The public 
has 60 days to comment on 
the proposals before the 
government finalizes them. 


Science migration 


A UK parliamentary group 
has recommended ways 

in which the country’s 
immigration system for skilled 
workers such as scientists 
should change after Brexit 


The number of animals used in UK labs in 2017 fell by 4% — mostly 
due to the 7% decline in experimental procedures on animals. 


— Total procedures == Experiments on animals 
— Genetically modified (GM) animals produced* 


Number (millions) 


o>. |—nw»¢n no, 
2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 


*Does not include experiments done on GM animals. 
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to accommodate workers 
from the European Union. 
EU citizens currently have 
automatic rights to live and 
work in the United Kingdom, 
but it is not yet clear whether 
they will retain any of those 
rights after the Brexit process 
is complete. A report by the 
House of Commons Science 
and Technology Committee, 
published on 19 July and based 
on discussions with research 
organizations, recommends 
that the system allow skilled 
workers to visit visa-free for 
up to 180 days. Although 

the recommendations were 
designed to address the issue 
of how EU workers will come 
to the United Kingdom after 
Brexit, the same policies could 
also be applied to workers 
from other countries, the 
report says. For long-term 
stays, the committee suggests 
that the government remove 
an annual cap on the “Tier 2’ 
visa for highly skilled workers. 


ENVIRONMENT 
China’s emissions 


As the world’s largest carbon 
emitter, China needs to do 
more if the world is to meet 
its climate-change goals, says 
a 19 July report by a former 
senior US energy official. 
David Sandalow, now at 
Columbia University in New 
York City, found that China’s 
carbon dioxide emissions 
have increased by only 3% or 
less in the past 5 years, even 
as its economy has grown by 
40%. The report argues that 
the country has enormous 
potential to contribute to 
climate-change solutions: 
government policies are 
helping to cut coal use and 
increase use of low-carbon 
alternatives, and the nation 
is investing in technologies 
such as electric vehicles and 
solar power. But some of its 
policies — such as converting 
coal to synthetic natural gas 
and financing coal power 
plants around the world — run 
counter to these goals, the 
report concludes. 
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NEWSIN FOCUS 


Government lures 
scientists home to bolster 
research workforce p.453 


cut off from journals in row 
over open access p.454 


German scientists Planck 


telescope’s final map 
marks end of an era p.455 


The race to defuse 
a ticking time bomb in 
southeast Asia p.458 
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Northern-quoll populations have declined by more than 75% since the introduction of poisonous cane toads, which the quolls mistake for food. 


Evolution experiment aims 
to save Australian marsupial 


If it succeeds in quolls, the conservation strategy could also help protect Tasmanian devils. 


BY APRIL REESE 


nanisland off Australia’s north-central 
() coast, researchers are conducting an 
unprecedented experiment: mixing 
endangered animals that have evolved genetic 
defences against their biggest foe with those 
that haven't, in the hope that their offspring 
will take after the wiser parent. 
The subject of the experiment is one of Aus- 
tralia’s most imperilled marsupials, the northern 
quoll (Dasyurus hallucatus). This squirrel-sized 


carnivore is struggling to survive a decades-long 
onslaught of poisonous and invasive cane toads, 
which quolls mistake for prey, with devastating 
results. The team has tested the match-making 
technique in captive-bred quolls, and reported 
the results last month (E. Kelly and B. L. Phillips 
Conserv. Biol. http://doi.org/csf2; 2018). 

The researchers are now testing the tech- 
nique in wild animals on Indian Island and, if 
the experiment is successful, it could provide 
some of the first real-world evidence that tar- 
geted gene flow — which involves pushing an 
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adaptive trait through an at-risk population 
to boost resiliency — could be used to save an 
endangered species. 

Australia, which has the world’s highest 
mammal extinction rate, is desperate for new 
conservation tools, says Euan Ritchie, a wildlife 
ecologist at Deakin University in Melbourne. 
The quoll research is exciting, he says, because it 
“provides hope we can still achieve conservation 
wins, even for the most challenging and press- 
ing of situations” The real test will be the ongo- 
ing wild trial. If targeted gene flow works, the 
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> research team says it could help other threat- 
ened species, too — including Tasmanian devils 
and corals on the Great Barrier Reef. 

In the 80 years since agriculture officials 
introduced the cane toad (Rhinella marina) 
to northeastern Australia to control a sugar- 
cane-devouring beetle, the amphibians have 
spread across the state of Queensland, the 
Northern Territory and large chunks of West- 
ern Australia. Their rapid advance has devas- 
tated northern-quoll populations, which have 
shrunk by more than 75%. 

Ecologists Ella Kelly and Ben Phillips knew 
from their previous research that some quoll 
populations in Queensland had developed an 
aversion to the toads over the years (E. Kelly 
and B. L. Phillips Behav. Ecol. 28, 854-858; 
2017). The researchers, both at the University 
of Melbourne, wondered whether the trait 
could be successfully bred into vulnerable 
quoll populations that cane toads hadn't yet 
reached. That could make those ‘naive’ quolls 
more resilient to toad invasions, if enough 
animals ina given group have the trait. 

To test this idea, the scientists bred animals 
in captivity, mixing northern quolls from 
a toad-infested area of Queensland that 


displayed an aversion to toads with naive 
quolls from a toad-free island in the North- 
ern Territory. Kelly and Phillips then exposed 
the resulting offspring to a toad leg to gauge 
whether the young quolls recognized the 
threat. They found that most of the young 
quolls wouldn't touch the toad legs. 

The finding suggests that the trait is inher- 
ited, rather than taught by mother quolls, and 
may be dominant, the researchers say. “That's 
the first hurdle that needs to be jumped in 
showing targeted gene flow,’ Phillips says. 
“Without a genetic basis, there is no point in 
introducing [toad-smart quolls] into the popu- 
lation. And we found there is a genetic basis.” 

The captive-quoll study is an important step 
towards demonstrating that targeted gene flow 
is a viable strategy to aid quoll conservation, 
says Sarah Fitzpatrick, a conservation biolo- 
gist at Michigan State University’s Kellogg 
Biological Station in Hickory Corners. “Many 
behaviours are plastic, and therefore are not 
necessarily controlled by certain genes,” she 
says. “If this were the case for toad-eating 
behaviour, targeted gene flow would not work” 

Buoyed by the results of the captive study, 
Kelly and Phillips, along with University of 


Melbourne colleague Chris Jolly, decided 
in May 2017 to see whether toad-smart and 
naive quolls would produce toad-smart off- 
spring in the wild. They released 54 quolls on 
toad-infested Indian Island — a mix of naive 
Northern Territory quolls, toad-averse quolls 
from Queensland and hybrid offspring. 

When the researchers returned in April this 
year to check on the quolls, they found good 
and bad news. Many fewer quolls survived than 
the team had anticipated — just 16 animals, 
according to the researchers’ population esti- 
mate. But encouragingly, the group included 
offspring that seemed to be toad-smart, which 
would suggest that they had inherited the trait 
from their parents. The team is now analysing 
genetic samples taken from the survivors. 

Kelly and Phillips plan to return to the site 
again next April, to see how the remaining 
quolls fare. 

As the experiment continues, the pair is 
seeking permission from wildlife officials 
in Western Australia to introduce northern 
quolls with the toad-smart trait into popula- 
tions in the path of the cane-toad diaspora to 
gird them against future invasions. “It would 
be a tragedy not to try,” Phillips says. m 
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NASA aims for 
Sun’s corona 


The Parker Solar Probe will make humanity’s closest 


approach to its home star. 


BY ALEXANDRA WITZE 


spacecraft that can fly through the Sun’s 
atmosphere without melting. 

On 6 August, if all goes to plan, the US$1.5- 
billion Parker Solar Probe will lift off from a 
launch pad at Florida's Cape Canaveral. Just 
three months later, it will whizz much closer to 
the Sun than any spacecraft has ever come, to 


S= aside, Icarus: NASA has made a 


take the first-ever direct measurements of the 
star’s maelstrom of energy. 

But that’s just the beginning. Over the next 
7 years, the craft will loop around the Sun 
another 23 times, passing nearer and nearer — 
ultimately flying about 6.2 million kilometres 
above the surface, well within the solar corona. 
That's nearly seven times closer than the record 
mark set by the German Helios 2 spacecraft 
in 1976. 


The Parker Solar Probe aims to answer some 
of the biggest outstanding questions about the 
Sun, such as how its corona is heated to mil- 
lions of degrees while the surface beneath it 
stays relatively cool’. The spacecraft will also 
visit the birthplace of the solar wind, a flood 
of energetic particles that streams out into the 
Solar System at speeds of up to 800 kilome- 
tres a second. When the solar wind slams into 
Earth, it generates beautiful polar aurorae, but 
it can also disrupt satellite communications 
and navigation systems. 

“We're going to be right where all the inter- 
esting stuff happens,” says Nicola Fox, a solar 
physicist at the Johns Hopkins University 
Applied Physics Laboratory (APL) in Laurel, 
Maryland, and the mission's project scientist. 

Data from the deep-diving probe should 
allow researchers to improve their under- 
standing of the complex picture of how 
particles, magnetic fields and energy com- 
bine in the Sun. “This is going to be such a 
game-changer,” says Nicholeen Viall, a solar 
physicist at NASA’s Goddard Space Flight 
Center in Greenbelt, Maryland. 
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The Parker Solar Probe will travel seven times closer to the Sun’s surface than any previous spacecraft. 


Space physicists have dreamt of a mission 
that would fly through the solar corona, or 
at least travel inside the orbit of Mercury, the 
innermost planet, since 1958. In the same year, 
Eugene Parker — the physicist at the Univer- 
sity of Chicago in Illinois for whom the probe 
is named — first proposed the existence of the 
solar wind’. 

After decades on the drawing board, the 
mission is finally approaching launch. Eight 
weeks after lift-off, it will fly past Venus, using 
the planet's gravity to slow down and slip into 
atighter orbit around the Sun. Five weeks after 
that, on 3 November, the probe will make its 
first close approach — at more than 24 million 


kilometres, or 35 times the solar radius, from 
its surface. 

From there, the spacecraft will loop around 
the Sun, drawing gradually closer as it flies 
past Venus six more times. That trajectory 
will give the probe ample time to gather data, 
says Yanping Guo, an engineer at APL who 
designed the mission's trajectory. 

Somewhere between the first close approach 
(at 35 solar radii) and its final ones (within 
10 solar radii) the probe will encounter the 
Alfvén surface, a boundary at which the solar 
wind becomes supersonic. Inside the Alfvén 
surface, the Sun’s magnetic field dominates; 
outside, the solar wind is more detached and 
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streams away on its own. 

Crossing that boundary with a spacecraft will 
be similar, symbolically, to the moment when 
the Voyager 1 probe entered interstellar space 
in 2012, says Justin Kasper, a physicist at the 
University of Michigan in Ann Arbor who has 
studied the Alfvén transition’. The moment will 
mark humanity's passage to another realm in 
the Solar System. “I'm confident that something 
special will happen,’ he says. 

The Parker Solar Probe bristles with an array 
of instruments designed to sample the corona 
directly. Protecting them is a 2.4-metre-wide 
heat shield made of 11-centimetre-thick 
carbon foam sandwiched between layers of 
carbon composite. It can withstand tempera- 
tures of nearly 1,400°C. The solar panels that 
power the spacecraft will be kept cool with a 
water-tubing system similar to a car's radia- 
tor. During the searing conditions of close 
approach, most of the solar panels will fold 
back to shelter in the heat shield’s shade. 

Mission scientists hope that the Parker Solar 
Probe will kick off a new era of studying the 
Sun. In 2020, the European Space Agency 
plans to launch its Solar Orbiter spacecraft, 
which will study the Sun at higher latitudes 
and from a more distant point in space than 
the Parker Solar Probe will. Also by 2020, the 
Daniel K. Inouye Solar Telescope will come 
online in Hawaii, where it will make daily maps 
of the solar corona. 

For his part, the 91-year-old Parker is 
looking forward to seeing the waves and turbu- 
lence in the solar wind — which he predicted 
— measured by the probe that bears his name. 
“T expect to find some surprises,” he says. = 
1. Fox, N. J. et al. Space Sci. Rev. 204, 7-48 (2016). 


2. Parker, E. N. Astrophys. J. 128, 664-676 (1958). 
3. Kasper, J. C. et al. Astrophys. J. 849, 126 (2017). 


Philippines sweetens deal for 
scientists who return home 


But some academics say more needs to be done to train and retain early-career researchers. 


BY ANDREW SILVER 


renewed government effort to draw 
Aisne scientists back to the Philip- 

pines by paying them to set up their 
own research labs or teach has met with 
a mixed reception. 

Some Filipino researchers applaud the goals 
of the effort. The government says it needs to 
bring research expertise back home to solve 
some of the country’s most pressing problems, 
such as climate-change mitigation. 


Others suggest that resources would be 
better directed to mentoring early-career 
scientists before they think about leaving. 
“If you want Filipino scientists, you grow 
them,” says Vena Pearl Bongolan, an applied 
mathematician at the University of the 
Philippines Diliman in Quezon City. 

An existing project designed to address the 
issue is the Balik Scientist Program, established 
in the 1970s. Since 1993, it has offered research 
grants for up to three years, as well as round- 
trip airfare and duty-free equipment imports, 
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to Filipino scientists willing to return home. 
Last month, President Rodrigo Duterte 
signed a law that instructs the Department 
of Science and Technology to allocate more 
money to the Balik programme by add- 
ing benefits for returning scientists, such as 
a monthly housing allowance, medical insur- 
ance and assistance for researchers’ children 
to attend schools of their choice. Between 
2007 and April 2018, 207 scientists joined the 
Balik programme, some for stints of a couple 
of months and others for several years. 
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> The programme cost 173 million Philip- 
pine pesos (US$3.2 million) from 2007 to the 
end of 2017. 

The department has not yet established 
exactly how the changes to the programme 
will work and how much extra money it will 
get. Several sectors are listed as high-priority 
recruitment areas, including space, energy, 
artificial intelligence and agriculture and food. 


GROWING DIASPORA 

In 2013, the Philippines had only 
187.7 scientists per million people, one of the 
lowest densities of researchers in the region. 
The latest figures from the Department of 
Science and Technology estimate that the 


The Philippine government wants to boost the number of agricultural scientists in the country. 


number of Filipino science and technology 
workers who moved overseas jumped from 
9,877 in 1998 to 26,917 in 2013. 

The revamped Balik programme is part of 
a government plan to increase development in 
the country, which came into effect last year. 
The government aims to boost the scientific 
workforce to 300 researchers per million 
people by 2022. 

Biologist Michael Velarde, a current Balik 
scientist at the University of the Philippines 
Diliman, says that the extra support for the 
programme is a good idea because the coun- 
try needs more overseas-trained researchers 
to address challenges including preventing the 
spread of diseases such as Zika and studying 


how climate change affects health. 

And he thinks the added financial incentives 
will attract more talent back to the Philippines. 

But Bongolan is frustrated that the country 
is paying to relocate scientists from abroad 
rather than offering more scholarships that 
encourage local students to pursue science at 
university. 

She participated in the Balik programme 
in July 2008 after studying and working in 
the United States for a decade, and is scepti- 
cal about whether some participants are in 
the Philippines for long enough to contribute 
meaningful research and development. 

The Department of Science and 
Technology did not respond to Nature’s 
request for details on what participants had 
achieved. But a department spokesperson said 
each scientist sets goals for their time in the 
country, and that all participants so far have 
met their targets. A statement posted on the 
department’s website last month noted that 
the Balik programme had “significantly con- 
tributed to the acceleration of the scientific, 
agro-industrial and economic development 
of the country”. 

Miguel Garcia, a Filipino neuroscience and 
economics PhD student at the University of 
Zurich in Switzerland, says that the Philip- 
pines needs more than money to lure research- 
ers back from overseas. Scientists need access 
to the right facilities and other researchers in 
their discipline; these are lacking in his field, 
he says. 

Garcia also thinks the government should 
be encouraging scientists of any national- 
ity to work in the Philippines. “Why tie your 
research interests based on someone's national- 
ity when the government could set up the kind 
of research it needs, and attract scientists — 
regardless of nationality — to do it?” he asks. m 
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Mega-publisher cuts 
off German scientists 


Negotiations with Elsevier have stalled over open access. 


BY HOLLY ELSE 


scientists in Germany from reading its 

recent journal articles, as a row esca- 
lated over the cost of a nationwide open-access 
agreement. The move comes just two weeks 
after researchers in Sweden lost access to the 
most recent Elsevier research papers, when 
negotiations on a contract there broke down 
over the same issue. 


ee last week stopped thousands of 
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Negotiators in Germany and Elsevier now 
seem to be waiting for the other to blink, says 
Joseph Esposito, a publishing consultant in 
New York City. The highly public nature of the 
stand-off means that “any deal Elsevier does 
with them becomes the de facto deal for the 
entire world’, he adds. 

Elsevier’s move to cut off some German 
researchers also provides a test as to whether 
the scientists can survive without a subscrip- 
tion deal with the mega-publisher, says Ralf 
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Schimmer, director of scientific information 
at the Max Planck Digital Library in Munich, 
Germany. “If it comes to hardship and misery, 
then the negotiators might be forced back to 
the negotiating table?” His organization pro- 
vides journal access to the dozens of Max 
Planck Institutes and their libraries, and its 
contract with Elsevier finishes at the end of 
this year. 

Elsevier declined to comment on the move, 
which was reported by negotiators, some 
affected libraries and Germany’s national 
university association. The Amsterdam-based 
company instead reiterated a 5 July state- 
ment saying it was committed to reaching a 
deal with the German consortium Projekt 
Deal, which is brokering an agreement on 
behalf of hundreds of Germany’s universi- 
ties and research organizations. Projekt Deal 
declared on 6 July that it had suspended talks 
with Elsevier. The publisher produces more 
than 2,500 journals, which issue in excess of 
400,000 papers each year. 


SUBSCRIPTION SHUTDOWN 

The stand-off was sparked by talks between 
Elsevier and Projekt Deal, which is pushing 
to create a collective subscription agreement 
to replace the individual deals each institution 
has held with the publisher. 

This new type of deal would offset the cost 
of publishing under open-access terms against 
the price paid for subscriptions to paywalled 
journals. These ‘read and publish’ contracts 
have become popular in recent years, as 
governments in some European countries 
have tried to make the fruits of publicly funded 
science open to all. 

Academic library consortia in Austria, the 
Netherlands, the United Kingdom, Sweden 
and Finland have all struck read-and-pub- 
lish deals with various publishers — includ- 
ing Wiley, Springer Nature and Taylor and 
Francis — that cover varying parts of their 
portfolios. (Nature’s news team is editori- 
ally independent of its publisher, Springer 
Nature.) Last month, the Massachusetts Insti- 
tute of Technology in Cambridge became the 
first US institution to enter into one, covering 
journals published by the UK Royal Society 
of Chemistry. But some national consortia 
are now coming up against fierce resistance 


to such contracts from Elsevier. 

In May, talks collapsed between Elsevier 
and the Swedish Bibsam Consortium, which 
brokers deals on behalf of 85 institutions 
across the country. Their existing contract 
expired on 30 June, and some researchers in 
Sweden have now lost access to all Elsevier 
journal articles published after this date. 


TOUGH COMPROMISE 

Negotiators in Germany and Sweden want all 
their papers published in Elsevier journals to 
be open access as part of any new contracts. 
They have said that they will not pay more 
than they did previously for subscriptions. 
But, until now, the Dutch publisher has offered 
other countries read-and-publish deals that 
cover only a small proportion of a country’s 
publishing output. 

Apart from a brief shutdown in early 2017, 
which affected about 70 German institutions, 
Elsevier has provided mostly uninterrupted 
access to German institutions whose contracts 
have expired, while negotiations continue. 
Around 200 are thought to be affected by the 
latest switch-off, according to Projekt Deal. 

The affected universities and research insti- 
tutes can still source missing Elsevier articles 
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through inter-library loans from the 150 or so 
institutes whose contracts have not yet expired. 

The pressure on Elsevier to accept a 
read-and-publish contract is increasing, says 
Bernhard Mittermaier, head of the central 
library at the Julich Research Centre in 
Germany and a member of the Projekt Deal 
negotiating team. He sees the widespread 
shift to ‘gold’ open access — whereby journals 
make papers freely available once published 
— as inevitable. “Publishers would be well 
advised to take it into their own hands and flip 
by themselves,” he says. “Science in Germany 
will not break down?” 

Esposito thinks that the German institutes 
have leverage over Elsevier because their 
researchers can access papers on the illicit 
sharing site Sci-Hub — but that does not 
necessarily mean that Elsevier will back down. 

“The Germans insisted on conducting this 
negotiation in public,” he says, which could 
lead to a no-deal scenario because other coun- 
tries might now want the same. If the stand-off 
continues without resolution, Elsevier will be 
closely monitoring journal article submissions 
from researchers based in Germany, Esposito 
predicts. “Ifit drops sharply, Elsevier will likely 
reconsider its position.” = 
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Big Bang telescope finale is 
end of an era in cosmology 


Collaboration behind Europe’s Planck mission releases its final maps of the early Universe. 


BY DAVIDE CASTELVECCHI 


transformative era in cosmological 
Aw ended last week when the 

European Space Agency’s Planck 
telescope released its final maps of the early 
Universe. Planck was the last in a line of three 
major space telescopes to study the cosmic 
microwave background (CMB), the faint 
afterglow of the Big Bang, resulting in the 
most precise measurements yet of the age, 
geometry and composition of the cosmos. 
With space agencies in Europe and the United 
States hesitant to fund a follow-up mission, 
Planck looks set to be the last CMB-focused 
satellite for many years — marking a big 
change for cosmologists. 

“There’s a whole generation of young 
scientists who grew up with Planck,” says 
cosmologist Jan Tauber, the mission's project 
scientist with the European Space Agency 
(ESA) in Noordwijk, the Netherlands. 

For more than two decades, scores of 


ground-based and balloon-borne experiments 
have also studied the CMB. They have largely 
focused on mapping minute variations in the 
CMB’s temperature across the sky to create 
charts of the Universe that have become the 
gold standard of cosmology. Planck, which 
collected data from 


2009 to 2013, helped “There’s awhole 
researchers to pin generation of 
down the age of the young scientists 
Universe (about whogrewup 


13.8 billion years), its with Planck.” 
geometry (essentially 

flat) and its composition (95% dark matter and 
dark energy). In particular, the latest release 
solidifies an earlier prediction based on Planck 
data that the Universe should be expanding 
9% slower than is currently observed (see 
go.nature.com/2jtOsbi). 

The temperature maps and the science they 
produced have been a “great achievement” but 
they don't have much more to give, says Peter 
Coles, a theoretical cosmologist at Maynooth 
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University in Ireland who is not part of the 
collaboration. 


PLANCK DIASPORA 

Many scientists who worked on the mission 
have already moved on to other projects. Silvia 
Galli, who help to lead the latest study, became 
part of Planck in 2013 after her PhD and is one 
of the few dozen scientists left. Now, she says 
she will probably join many of her colleagues 
who are working on Euclid, a major European 
mission to map the Universe's galaxies on 
an unprecedented scale that is preparing for 
launch in 2021. Euclid is an optical telescope, 
not a microwave detector, which makes it a 
technically different kind of mission. On a 
personal level, it is exciting to move on to new 
endeavours, she says. 

But the prospect of no major CMB mission in 
the pipeline worries many researchers. “Scien- 
tifically, it would be a disaster,” says Galli, who 
is based at the Institute of Astrophysics in Paris 
(IAP). “There isa risk that alot ofknow-how > 
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> and expertise that had been accumulated 
willbe lost” 

The focus of CMB research 
is now to make detailed 
measurements of other 
parameters, including 
polarization: a slight 
tendency for the micro- 
waves electromagnetic 
fields to align in par- 
ticular directions. In 
this, researchers hope to 
find a signature of inflation, 
the brief period at the begin- 
ning of the Big Bang during which 
the Universe would have expanded expo- 
nentially. Planck mapped polarization — as 
did a NASA telescope in the 2000s — but with 
limited sensitivity. “Only 10% of the informa- 
tion in the polarization has been exploited,” 
says Jacques Delabrouille, an astrophysicist at 
the University of Paris Diderot who helped to 
design Planck. “The CMB still has plenty of 
secrets to yield” 


THE POLARIZATION PROBLEM 

NASA and ESA have so far declined to 
fund big new satellites to study the CMB — 
although several US groups are working on 
ground and balloon instruments to measure 
polarization. One reason for space agencies’ 


The Universe’s cosmic microwave background 
measured by the Planck satellite. 


reticence to fund big CMB projects is that 
experiments have yet to find a polarization 
signature of inflation. An experiment called 
BICEP2 claimed to have detected that signa- 
ture in 2014, but Planck data later showed that 
it was just dust in the Milky Way. In the latest 
studies, the Planck collaboration also looked 
for an inflation signature but didn't detect one. 
However, “the fact that we haven't seen it yet 
doesn’t mean that it’s not there’, says cosmolo- 
gist Richard Gott of Princeton University in 
New Jersey. 


Many of the US teams are now joining 
forces and seeking funding to build a 
US$400-million, next-generation 
network of ground telescopes 
called CMB-S4, which will 
be much more sensitive 
to the inflation signa- 
ture than anything that 
has come before. That's 
the next big thing on the 
horizon, Tauber says. 
And many CMB 
researchers continue to 
advocate for space missions. 
Planck cosmologist Erminia Cala- 
brese of Cardiff University, UK, is pushing 
for Europe to join LiteBIRD, a proposed Japa- 
nese probe that would look for the signature 
of cosmic inflation while keeping costs down. 
From space, it would have the advantage of 
seeing the entire sky — something that can- 
not be done from the ground. 

Other cosmologists are trying to jump-start 
CMB-polarization research in Europe. Davide 
Maino, a senior Planck member at the Uni- 
versity of Milan in Italy, is collaborating on an 
Italian-led polarization experiment that will 
be based on the Spanish island of Tenerife. If 
ground-based experiments see even a hint of 
inflation, that will encourage space agencies to 
fund major missions, researchers say. m 
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Malaria is a ticking time bomb in southeast Asia. | 
Scientists are now racing to stamp it out before a 
unstoppable strains spread. 


BY AMY MAXMEN 


Behe? 
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Malaria has been on the rise in northeastern Cambodia, where people live in open homes in rural, mosquito-ridden areas. 


hrongs of men and women ventured into the 
forests of northeastern Cambodia in April, 
lured by a bumper crop of a rare tropical 
treat called samrong. After days of hiking 
through the wilderness, some of the travel- 
lers returned to their homes with a bounty of 
the wrinkled seeds, which fetch a high price 
as a special dessert or an ingredient in tradi- 
tional Chinese medicine. But many soon fell 
ill. Khong Chhoem, a 56-year-old rice farmer, says the fevers hit him 
a few days after the expedition. His muscles hurt. His eyes hurt. He 
had unbearable nightmares. A health worker told Chhoem that he 
had tested positive for Plasmodium falciparum, the deadliest kind of 
malaria-causing parasite. But because a wave of malaria was sweeping 
through the region, medicine was in short supply. Chhoem eventually 
found a shop that carried the drugs he needed, and he recovered. But 
in the intervening days, mosquitoes probably sucked up the parasites 
in his blood and spread them to other people. 

After years in decline, malaria infection rates seem to be on the 
rise in northeastern Cambodia, where people are moving deeper into 
lush, mosquito-ridden territories in search of timber and seasonal 
goods such as samrong (Scaphium affine). Their movements provide 
opportunities for P falciparum — which requires both human and 
insect hosts — to thrive. There are other contributors as well, such as 
treatment delays that allow the parasites to linger and spread, and an 
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alarming decline in the potency of gold-standard malaria drugs called 
artemisinin-based combination therapies (ACTs). 

What happens next here matters for the entire world; malaria 
remains one of the biggest killers in low-income countries. Estimates 
of the number of deaths each year range from 450,000 to 720,000 — 
and ACT pills keep that toll from being much higher. And although 
southeast Asia accounts for just 7% of malaria cases worldwide, it 
has a notorious history as the breeding ground for strains of malaria 
parasites that resist every drug thrown at them and then spread to 
other regions. 

In 2015, reports of drug resistance prompted the governments of 
five countries in the Greater Mekong Subregion — Cambodia, Thai- 
land, Vietnam, Laos and Myanmar — to pledge to banish P falciparum 
from the region by 2025. Together with the World Health Organiza- 
tion (WHO), the countries drew up plans and budgets. This year, the 
nations’ governments have committed US$41 million towards the 
effort; the Global Fund to Fight AIDS, Tuberculosis and Malaria also 
backed elimination efforts in the region, with a 3-year, $243-million 
grant. Donors such as the Bill & Melinda Gates Foundation and the 
Asian Development Bank will add more than $20 million to the fight 
this year. 

But the rise of cases in northeastern Cambodia shows how difficult 
getting to zero will be — and how crucial. As long as P. falciparum 
exists, it can resurge. And the last parasites remaining are the hardest 
to find. They reside in the hinterlands, borderlands and war zones 
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— and in people who show no signs of the disease. “Malaria is very 
clever — it hides out where you don’t know and comes back when you 
aren't ready,’ says Ladda Kajeechiwa, a malaria-programme manager 
at a branch of the Mahidol Oxford Tropical Medicine Research Unit 
(MORU) in Mae Sot, Thailand. 

To eliminate malaria, public-health officials are trying to cover the 
region with volunteers and health workers who can dispense malaria 
drugs promptly, and report any signs of an upsurge. They are also 
looking to researchers for tools that can forestall drug resistance, 
technologies that improve detection of the parasite, and evidence on 
strategies for rooting it out. The pressure is on. “Everybody has fears of 
malaria resurgence,” says Myaing Nyunt, a malaria researcher at Duke 
Global Health Institute in Durham, North Carolina. She worries that 
donors will tire of the cause, and that the people in the Greater Mekong 
region will grow to resent the obsession with malaria when they have 
more pressing needs. Yet if the effort lets up, malaria will roar back. 

“That is why it is so important to eliminate it as quickly as possible,” 
Myaing Nyunt says. 


THE RISE OF RESISTANCE 

Southeast Asia has served as a launch pad for drug-resistant 
malaria multiple times, with disastrous consequences. In the 1950s, 
Cambodian authorities reported that chloroquine, the main cure for 
malaria at the time, had stopped clearing some infections. Within 
three decades, chloroquine-resistant strains had spread to Africa’, 


Near Cambodia’s border with Laos, people often travel on small tractors to harvest timber and other goods from the forest, where they can contract malaria. 
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know and comes back when 
you aren’t ready.” 


where 90% of the world’s malaria cases occur, and the death toll started 
to climb. By 2000, malaria was killing about one million people a year 
globally. Doctors said it was like watching a couple of Boeing 747 
planes crash every day. 

Around 2005, the trend started to reverse as ACTs became 
available around the world. The main ingredient in these treatments, 
artemisinin, rapidly wipes out the majority of parasites, and a longer- 
acting partner drug in the combo mops up the rest. When ACTs 
are working as they should, the number of parasites detectable in 
a finger-prick of blood drops by half within three hours of taking 
the medicine. 

But in 2008, researchers in western Cambodia reported” that 
parasite numbers in some patients were not falling as quickly as they 
ought to. To monitor the situation, scientists throughout the region 
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A technician in Siem Pang, Cambodia, prepares blood cells for shipment to Bangkok, where biologists will study the malaria parasites living in the cells. 


started shipping blood samples to high-tech labs in cities including 
Bangkok, Singapore and London. It was through such work that 
researchers uncovered a secret weapon that parasites had gained 
against artemisinin’ — mutations in a gene called Kelch 13. They 
found that several Kelch 13 mutations had popped up in the Greater 
Mekong Subregion, and the share of parasites harbouring such 
mutations was rising. Resistance to artemisinin didn't cause a rise in 
deaths, partly because of the partner drugs. But some of those drugs 
had begun to falter, too. One lineage of P._ falciparum from Cambo- 
dia was found to contain both a powerful Kelch 13 mutation and a 
gene duplication that helps it to survive treatment by the partner 
drug piperaquine*. Last October, molecular biologists reported’ that 
this parasite had spread to Thailand, Laos and Vietnam. Now, some 
members of that family also resist another partner drug, mefloquine® 
(see ‘A crucible for resistance). 

Drug resistance has a way of growing from a nuisance to a serious 
problem. Although an incompletely cleared infection might not killa 
person, the parasites that survive can go on to infect others and their 
lineage grows stronger. To stay a step ahead of resistance, researchers 
at MORU renovated a dilapidated clinic in Siem Pang in northeast- 
ern Cambodia earlier this year. They are expanding a trial to test a 
combination of artemisinin with two partner drugs, rather than one. 
That combination should kill parasites that haven't yet made it to the 
triple-resistant stage. But trial leader Arjen Dondorp, deputy director 
of MORU in Bangkok, says that the only certain way to halt resistance 
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“All the investments we’ve 


made so far here and in Africa 
will amount to an incredible 
waste of money.” 


is to rid the Mekong region of malaria altogether. At the moment, 
the WHO considers elimination feasible because there are relatively 
few infections in southeast Asia. But the opportunity to do so wont 
last long: as ACTs start failing, case numbers will rise. “This is the 
scenario we predicted and now we see it unfolding in Cambodia,” 
Dondorp says. 

To hasten the elimination of P falciparum in the region, public- 
health officials in the five countries have promised to strengthen 
the basic mechanisms of malaria control, which include main- 
taining a steady force of health workers. But gaps remain in this 
system, particularly in Cambodia and Myanmar, which are among 
the poorest countries in Asia. To help them, Benjamin Rolfe, head 
of the Asia Pacific Leaders Malaria Alliance in Singapore, has been 
garnering funds and political support throughout Asia. “We are 
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so close to the end game that it is not good business sense to let 
drug resistance happen,” Rolfe says. “If we do, all the investments 
we've made so far here and in Africa will amount to an incredible 
waste of money.’ 


A SCOURGE IN HIDING 

Recent findings suggest that the task of elimination will be even harder 
than expected. Health workers typically diagnose malaria using a sim- 
ple rapid diagnostic test (RDT): a pin-prick of blood is added to a test 
strip, which quickly turns red ifit detects proteins from P falciparum. 
The tests adequately distinguish malarial fevers from other infections. 
But Mallika Imwong, a molecular biologist at Mahidol University in 
Bangkok, wondered whether some people might carry too few para- 
sites to make them ill — or to show up on a RDT. To find out, she 
fished for parasite DNA in vials of blood shipped from Cambodia, 
Myanmar and Vietnam, using a sensitive method called real-time 
polymerase chain reaction (PCR). In 2015, she reported’ that only 
one-quarter of Plasmodium infections were detected by the usual 
RDT on average. The disparity was at times remarkable. “I was really 
surprised,” says Imwong. “In one village, only 4% or 5% of the people 
were positive with RDTs, but 68% were infected.” 

Researchers have not yet discovered how or why the parasites can 
persist at low levels in a person’s bloodstream. Whatever the reason, 
there are signs* that these covert microbes multiply when conditions 
are right — such as when the rainy season starts. 
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Knowing that silent infections exist is one thing, but actually finding 
them is quite another. Health officials say it is unrealistic to conduct 
PCR analyses on vials of blood collected across the Greater Mekong 
Subregion. Rather, health workers need something like a rapid diag- 
nostic test that requires no refrigeration, lab equipment or special 
training, but that can detect parasites even when they are not abundant. 

In central Myanmar, a roving team of doctors, researchers and 
technicians are piloting one such technology. Thura Htay, a young 
physician and public-health researcher swerves on his motorcycle over 
rugged red-dirt roads with a green plastic supply bag gripped between 
his thighs. He is pensive, overseeing his group on its mission to collect 
pin-pricks of blood from 357 people in 3 remote villages by the end of 
the day. In these mountains, the things most likely to thwart the mission 
are flash floods and signs of wild elephants. (When startled by the sound 
of motorbikes, the animals sometimes attack people, homes and farms.) 

Today, the path remains clear and the team arrives at the Shwe Kyin 
village before noon. A few dozen people linger ona concrete platform, 
where tables, chairs and bowls of pickled tea leaves, fried nuts and 
sweets have been neatly arranged in anticipation of the team’s visit. 
Although no one from the village has been diagnosed with malaria this 
year, Thura Htay says, “I want to know if asymptomatic malaria is here”. 

A teacher ina rhinestone-studded top holds out her finger toa team 
member. He pricks it and places a drop of blood onto a conventional 
diagnostic test, and then onto an experimental version that the team 
is assessing to see whether it is more sensitive. Then he pricks another 
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Forests along the Mekong river and its tributaries provide a breeding ground for mosquitoes carrying malaria. 


finger and presses it against two kinds of filter paper — one intended 
to capture DNA, and the other, proteins. The team swiftly hangs the 
papers ona clothes line to dry. 

The teacher, Khin Zar Lwin, says she’s happy to help scientists 
learn about her village, but the emphasis on malaria by this and other 
groups feels like overkill; few people here get the disease, and there are 
other pressing health concerns. Pregnant women and elderly people 
who need urgent care, for example, must travel for hours on foot and 
then by boat down a tumultuous river to reach the nearest hospital, 
in the town of Singu. 

In the late afternoon, the team packages the tests and blood spots 
into coolers strapped to the back of motorcycles, and returns to 
a concrete building in Singu. Team members hang the papers in 
a pair of cupboards, cooled and screened off to protect against 
mould and flies. Thura Htay sleeps on a bed beside them to ensure 
they are safe. 

Every few weeks, the team ships samples to a renovated lab at the 
government’s Department of Medical Research in Yangon. There, 
researchers compare the results of the two rapid diagnostic tests with 
results from a PCR analysis on blood spots collected on one of the 
filter papers. 

By the end of this year, Thura Htay — and his boss, Myaing Nyunt 
— will know whether the new diagnostic tests can be used to map 
out the regions where symptomatic and asymptomatic malaria 
occur. Such ‘hotspot’ maps will help officials to decide on elimination 
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“We work day by day, year by 
year, without guarantees on 
what happens tomorrow.” 


strategies for different regions. But the question remains of how to 
handle communities with hidden parasites. One possible answer 
comes from eastern Myanmar. 

A few years ago, researchers mapped the malaria hotspots in Karen, 
a state along Myanmar’s border with Thailand that has resisted gov- 
ernment rule since the country gained independence from Britain 
seven decades ago. (A symbol of that is its rejection of the state's offi- 
cial name, Kayin.) Conflict has made many residents mistrustful of 
outsiders. But when MORU’s Ladda Kajeechiwa visited one of the 
hotspots in May this year, an old man in the village of Ta Oo Hta 
greeted her warmly. 

The man, Bo Lay, recalls the toll that malaria took during the 
community’s darkest days. Burmese troops and armed groups 
attacked Ta Oo Hta several times, he says, and burnt it to ashes at 
least once. Each bout of violence drove people into the mountains, 
where they slept among the snakes and leeches. Those who became 
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severely ill with malaria trekked through the jungle and crossed 
the Moei River to refugee clinics along the Thai border in Mae Sot. 
“Sometimes they died along the way,’ he says. 

Bo Lay was glad when he heard about MORU’s plan to wipe out 
malaria, and he discussed it with his neighbours, easing their suspi- 
cions of outsiders asking to draw blood and deliver drugs. This plan 
consisted of controlling malaria — and then pressing the accelerator on 
elimination. Beginning in 2014, MORU and other local organizations 
strengthened a network of 1,222 village malaria workers by training 
and paying them to treat and report any malaria cases that they find. In 
addition, they analysed blood samples across the state with PCR to find 
pockets of malaria. They then asked every healthy person in a hotspot 
— about 4% of Karen’s population — to swallow ACTs once a month 
for three months’. In surveys after the intervention, the researchers 
found that the incidence of malaria in the state had dropped by 90%. 
The only stronghold of the disease is in a rugged region in the north, 
where armed separatist groups rarely accept outside interventions. 

“We have strong evidence that elimination can happen,” says 
Frangois Nosten, a French researcher who founded MORU’ facility in 
Mae Sot, the Shoklo Malaria Research Unit, in 1986. But to his frustra- 
tion, governments in the region have put mass-drug-administration 
strategies on hold for a number of reasons. 

For instance, it takes a great deal of work to convince people who 
don't feel ill to take medicines that can cause fatigue and nausea, and it 
requires funds to pay and organize health workers to survey populations 
for the parasite and distribute drugs. Ifa significant number of people 
dontt take the full course of therapy, parasite reservoirs remain — and 
some researchers fear that this situation could foster drug resistance. 

Karen's nearly disease-free status remains tenuous as long as 
malaria exists in the broader region. And its defence against imported 
cases relies on continued international aid because there is no stable 
health system in Karen. At the moment, the Wellcome Trust, the 
Global Fund and the Bill & Melinda Gates Foundation provide funds 
for malaria workers that ensure that any cases that pop up in the state 
are treated before the parasites have a chance to spread. But if donors 
put their funds elsewhere — or fighting breaks out again in Karen, as 
it has in the past — the system will fall apart and malaria will resurge. 
“The NGOs [non-governmental organizations] could pull out, the 
political situation is unstable,’ Kajeechiwa says. “We work day by 
day, year by year, without guarantees on what happens tomorrow.’ 

A similar fragility exists in other states along Myanmar’s border, 
such as Kachin in the north and Rakhine in the west, the site of 
the ongoing Rohingya crisis. Since August 2017, more than half 
a million members of the Muslim Rohingya ethnic group have 
fled killings, arson and rape at the hands of Myanmar’s troops and 
nationalistic Buddhists. In these areas of the country, aid organiza- 
tions and private clinics handle malaria because the public-health 
system is fractured. That concern resonates with Saw Nay Htoo, 
programme director of the Burma Medical Association, a non-profit 
organization working with minority ethnic groups. In an office in 
Mae Sot where vines penetrate the panelled ceiling, he says,“we want 
to eliminate malaria, but first we need peace”. 


THE LAST MILE 
Myaing Nyunt believes the common enemy of malaria could help 
to unite warring factions. She manifests that hope. A political dissi- 
dent who fled from Myanmar’s military government in 1988, Myaing 
Nyunt now works with government researchers there. Squeezed into 
the back seat ofa jeep beside Zaw Lin, an official in Myanmar’s Vector- 
Borne Disease Control department in Nay Pyi Taw, she explains why 
she decided to partner with researchers who are part ofa government 
that has been accused of human-rights violations. Because there is no 
quick fix for malaria — no powerful vaccine — the fight against the 
disease requires a sustainable health system. As a scientist, she says 
she can develop top-notch malaria-elimination approaches and tools, 
but those won't matter if governments do not put them into practice. 
Always frank, Nyunt says that the decision to work closely with 
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Blood spots collected in Myanmar will be used to validate diagnostic tests that 
detect the malaria parasite with more sensitivity than current technology. 


the government has made her research slow and hard. There are 
bureaucratic meetings to attend, labs to be built from scratch and 
students who are in need of long-term mentorship and stipends. 
“We are talking about a health system that has been underfunded 
and underdeveloped since the 1960s, and that’s not going to change 
overnight,’ she says. “But you have to do it this way because changes 
must come from within” 

If the new diagnostic that she and Thura Htay are road-testing in 
central Myanmar works, Zaw Lin will help the government to decide 
whether to pre-emptively distribute ACTs in hotspots, as was done in 
Karen, or to routinely screen people in those places with the new test, 
and treat those who test positive. He doesn't shy away from talking 
about the country’s conflicts. In addition to political instability, he 
and other public-health officials find themselves pushing to sustain 
enthusiasm for their mission amid other health concerns. But he has 
seen at first hand how malaria bounces back if it’s not obliterated. 
It resurged in Myanmar in the 1990s, just as it seems to be doing in 
Cambodia and several other countries now. 

Losing inertia in the Mekong region, the motherland of drug 
resistance, could be a disaster. “This is the last-mile challenge in front 
of us, Zaw Lin says. “If we dont eliminate malaria, deaths can shoot 
up like a hidden bomb?’ = 


Amy Maxmen writes for Nature from San Francisco, California. 
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An engineer inserts a lithium-ion battery from an electric vehicle into a testing system for recycling. 


Ten years left to redesign 
lithium-ion batteries 


Reserves of rare metals used in electric-vehicle cells are dwindling, so boost research 
on iron and silicon alternatives, urge Kostiantyn Turcheniuk and colleagues. 


lectric vehicles need powerful, light 
E and affordable batteries. The best 

bet is commercial lithium-ion cells 
— they are relatively compact and stable. 
But they are still too bulky and expensive 
for widespread use. 

The performance of rechargeable lithium- 
ion batteries has improved steadily for two 
decades. The amount of energy stored ina 
litre-sized pack has more than tripled, from 
around 200 watt hours per litre (Wh) to 


more than 700 WhI". Costs have fallen by 
30 times, to around US$150 per kilowatt 
hour (kWh). But that still exceeds the $100 
per kWh goal for affordability set by the 
US Department of Energy. And batteries 
that are powerful enough for an electric car 
(50-100 kWh) still weigh around 600 kilo- 
grams and take up 500 litres of space. 

The pace of advance is slowing as 
conventional technology approaches fun- 
damental limits. The amount of charge that 
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can be stored in gaps within the crystalline 
structures of electrode materials is nearing 
the theoretical maximum. Projected market 
growth will not lower prices substantially — 
the markets are already large. 

Worse, the materials used in electrodes, 
notably rare metals such as cobalt and nickel, 
are scarce and expensive. Surging battery 
production has almost quadrupled whole- 
sale prices of cobalt over the past two years, 
from $22 to $81 per kilogram. 
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The high cost of extracting metals from low-concentration ores pushes up their market price. 
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>» High demand and prices are already 
encouraging some producers to cut corners 
and violate environmental and safety regula- 
tions. For example, in China, dust released 
from graphite mines has damaged crops 
and polluted villages and drinking water". 
In Africa, some mine owners exploit child 
workers and skimp on protective equip- 
ment such as respirators. Small artisanal 
mines, where ores are extracted by hand, 
often flout laws. Some companies, includ- 
ing BMW, follow strict policies to verify their 
cobalt suppliers’. Many do not. 

Alternative types of electrode based on 
cheap, common metals such as iron and 
copper need to be developed urgently. In our 
view, the most promising candidates involve 
‘conversion materials, such as copper or iron 
fluorides and silicon. These store lithium 
ions by bonding chemically with them. 
But the technology is still at an early stage. 
Problems with stability, charging speed and 
manufacture must be overcome. 

We call on materials scientists, engineers 
and funding agencies to prioritize the 
research and development of electrodes based 
on abundant elements. Otherwise, the roll- 
out of electric cars will stall within a decade. 


SCARCE AND EXPENSIVE 

Lithium-ion batteries work by shuffling 
lithium ions between two electrodes. Ions 
flowing from the anode to the cathode dis- 
charge a current, which powers the car. The 
lithium ions flow back when the battery is 
recharged. 

In commercial cells used today for electric 
vehicles, the lithium ions are held in tiny 
voids within the crystals that make up the 
electrodes (these are known as intercalation 
electrodes). The anodes are typically made 
from graphite and the cathodes from metal 
oxides. Common oxides include lithium 
nickel cobalt aluminium oxide (NCA, 
commonly LiNiy sCo9 ;;Aly 950) or lithium 
nickel cobalt manganese oxide (NCM, often 
LiNiy <CO92Mny 0, or LiNiy sCo9;Mny ,O,). 
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A lithium-ion car battery with a 100 kg 
cathode requires 6-12 kg of cobalt and 
36-48 kg of nickel. 

The prices of metals reflect demand, 
supply and the costs of extracting them 
from ores. Cobalt is pricey because it is rare 
and highly sought after. It requires capital- 
intensive processes to produce it, involving 
roasting, flash smelting and the consump- 
tion of poisonous gases’. Cobalt is often a 
by-product of copper and nickel mining, and 
can also need separating from other metals. 

Few cobalt mineral deposits are concen- 
trated enough to be worth mining. Most 
deposits contain just 0.003% of the metal; 
more than 0.1% is needed to achieve prices of 
$100 to $150 per kg (ref. 4). Production costs 
jump for poor ores because more rock must 
be processed (see ‘Metal prices’). Thus, only 
10’ tonnes of cobalt out of 10’° tonnes poten- 
tially available in Earth’s crust are profitable 
to extract*. Similarly, only 10° of 10'° tonnes 
of nickel reserves are commercially viable’. 


ENERGY ADVANTAGE 


Batteries that use conversion electrodes can 
store more energy in a given unit stack volume 
than those using conventional electrodes. 


Conversion electrodes 


FeF,-Si 
FeF,-Si 
CuF,-Si 
FeCl,-Si 


Conventional electrodes 


0 500 1,000 1,500 2,000 
Energy density (watt hours per litre) 


C, carbon (graphite); LFP, lithium iron phosphate; NCA, lithium nickel 
cobalt aluminium oxide; NCM, lithium nickel cobalt manganese 
oxide; LMO, lithium manganese oxide. 
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Cobalt-rich minerals are found in just a 
few places®. The Democratic Republic of 
the Congo (DRC) supplied more than half 
(56%) of the 148,000 tonnes of the metal 
mined worldwide in 2015 (ref. 6). Most of 
this goes to China, which holds stockpiles of 
200,000 to 400,000 tonnes®. Australia hosts 
14% of the world’s cobalt reserves but has 
yet to exploit them fully. Cobalt has been 
extracted from the deep sea floor, but min- 
ing here would be too expensive, ecologically 
and economically. 

Likewise, nickel production is dominated 
by a dozen nations. In 2017, Indonesia, the 
Philippines, Canada, New Caledonia, Russia 
and Australia together supplied 72% of the 
2.1 million tonnes mined globally. Of this, 
less than one-tenth went to batteries; the 
rest was used mainly in steel and electronics. 
Nickel is cheaper to extract than cobalt, 
through a series of reactions with hydrogen 
and carbon monoxide’. Nonetheless, rising 
demand has boosted nickel prices by about 
50% since 2015, from $9 to $14 per kg. 

Both cobalt and nickel have suffered 
sudden price hikes and crashes. For 
example, disrupted Australian supplies, 
increased demand from China for steel and 
speculation by hedge-fund managers led to 
a five-fold surge in the price of nickel and a 
tripling of that for cobalt in 2008-09. 


PROJECTED SHORTFALLS 

If nothing changes, demand will outstrip 
production within 20 years. We expect this 
to occur for cobalt by 2030 and for nickel by 
2037 or sooner. 

Car manufacturers and governments 
project that 10 million to 20 million electric 
cars will be produced each year by 2025. If 
each car battery requires 10 kg of cobalt, by 
2025, electric vehicles would need 100,000- 
200,000 tonnes of cobalt per year — most of 
the world’s current production. Similarly, 
400,000-800,000 tonnes of nickel would be 
required annually, or 20-40% of all the metal 
used today. More would be needed when 
trucks, buses, aeroplanes and ships become 
battery-powered. 

By 2050, producing 50 million to 80 million 
electric vehicles a year would require 
500,000-800,000 tonnes of cobalt. Beyond 
2030, this would far exceed current mining 
capacities. Similarly, 2-3 times more nickel 
would be needed by 2050. Nickel shortages 
would be evident by the mid-2030s. 

Recycling cannot replenish supplies. 
Lithium-ion batteries last for 15-20 years, 
3 times longer than the 5-7 years for lead- 
acid batteries. Refiners might exploit poorer 
quality ores, especially as prices climb. 
But greater processing costs would push 
the prices higher. Once supplies peak, we 
estimate that the price of an electric car bat- 
tery could leap by more than $1,000. ‘Peak 
cobalt’ might be delayed by a few years if 
cathodes can be made using less metal. But 
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A miner enters a copper and cobalt mine in Kawama, Democratic Republic of the Congo. 


cathode materials that use less cobalt, which 
are under development, degrade quicker 
and would need to be replaced more often. 


ABUNDANT MATERIALS 
The answer is to make lithium-ion battery 
electrodes from common metals, such as iron 
and copper. A kilogram of iron, for example, 
costs just 6-9 cents. The global iron reserves 
stand at 76 billion tonnes: easily enough to 
supply the billions of batteries that could 
replace today’s petroleum car engines. 
Already, Chinese electric buses use con- 
ventional cathodes made from iron-rich 
materials such as lithium iron phosphates 
(LFPs; LiFePO,), which can survive many 
cycles of charging. But they are less efficient at 
holding charge than cobalt- and nickel-based 
cathodes (see “Energy advantage’). Twice as 
many iron-based cells*”, at nearly twice the 
cost, are needed to drive the same distance. 
Equally inefficient are commercial elec- 
trodes made from another common metal, 
manganese (such as lithium manganese 
oxide or LMO, commonly as LiMnO, or 
LiMn,O,). Earlier this year, a cathode (based 
on lithium manganese niobium oxyfluoride) 
was demonstrated in the lab that could 
almost match the energy storage capacity of 
cobalt- and nickel-based materials’. But a 


high voltage is needed to charge it, making 
it unsafe for use in vehicles. 

The most promising alternative, in our 
view, is to use conversion materials in elec- 
trodes. Copper and iron fluorides and sili- 
con react with the lithium ions. A transition 
metal in a conversion cathode can host up to 
six times more lithium atoms than one ina 
standard cathode. Such materials swell more 
to accommodate the extra lithium atoms. 
Cell designs must allow for this distortion. 

Future anodes are likely to be based on 
silicon. It is produced from sand and stores 
nearly ten times more lithium ions by mass 
than graphite does. Combining conversion 
cathodes with silicon anodes in the next 
generation of lithium-ion battery cells could 
allow cells to store more than twice as much 
energy as the best conventional ones by vol- 
ume, and more than three times by weight*”. 
Half as many cells would be required to 
power electric vehicles, also halving costs, 
weight and volume. 

Silicon-anode technology is advancing. 
Tesla already uses small amounts of silicon 
in anodes made of graphite for lithium- 
ion cells in its electric vehicles, and BMW 
announced plans to incorporate silicon- 
dominant anodes in its future electric vehi- 
cle batteries. Other companies, too, are 
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developing silicon-rich anode materials. 
These include the Californian firms 
Enevate in Irvine, Enovix in Fremont and 
Sila Nanotechnologies in Alameda (G.Y. isa 
board member, shareholder and chief tech- 
nology officer of Sila Nanotechnologies). 

Metal fluoride cathode technology has yet 
to move beyond the lab’. Conversion-type 
batteries can take 20 hours to power up; this 
needs to be reduced to tens of minutes. They 
also require up to one-third more energy to 
charge; this should be no more than 10%. 
And their stability needs to be improved, 
from 5-500 cycles to 1,000-2,000 cycles. 

Undesirable interactions need to be 
minimized between conversion materials 
and electrolyte substances. And the micro- 
structures and compositions of electrodes 
and electrolytes must be optimized. New 
architectures need to be explored that con- 
tain the swelling and shrinkage of electrodes 
while maintaining their conductivity. 


NEXT STEPS 
Several technological breakthroughs are 
needed to secure the future of affordable 
battery-powered transport. 

First, cell performance needs to improve. 
Materials scientists need to produce high- 
performance fluoride materials for the > 


26 JULY 2018 | VOL 559 | NATURE | 469 


> electrodes. Electrochemists should 
develop more-effective electrolytes. 
Engineers need to develop tools for 
manufacturing these materials. 

Government agencies and leading car 
manufacturers should fund this research, 
which will require billion-dollar invest- 
ments. In our view, the best way to appor- 
tion this money is by targeting projects 
addressing key battery challenges, as the 
US Department of Energy’s Advanced 
Research Projects Agency-Energy 
(ARPA-E) does. Developing batteries 
free from cobalt and nickel should be 
one priority. 

As promising battery materials and 
cell technologies emerge, funding should 
be refocused towards improving their 
characteristics and viability. Production 
processes and costs must be addressed. 
We expect that synthesizing conversion 
electrode materials will require differ- 
ent steps, including forming certain 
nanoscale structures. Treatments involv- 
ing series of solutions and gases, for 
example, might be borrowed from other 
sectors such as food, pharmaceuticals, 
filtration and composite manufacturing. 

Lithium-ion battery manufacturers 
have already invested billions of dollars 
in dozens of ‘giga-factories to boost the 
electric-vehicle market. With co-ordi- 
nation, thought and planning, these can 
be set on a new path to deliver the next 
generation of affordable batteries. m 
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Pet genomics 
medicine 
runs wild 


Genetic testing for dogs is big business. It is too easy for 
companies to Sell false hope, warn Lisa Moses, Steve 
Niemi and Elinor Karlsson. They call for regulation. 


ast year, a 13-year-old dog, let’s call 
L= Petunia, started having trouble 

walking and controlling her bladder 
and bowels. Distressed, her owners bought 
a US$65 genetic test through a direct-to- 
consumer (DTC) company. It suggested that 
the pug carried a mutation that is linked to a 
neurodegenerative condition similar to the 
human disease amyotrophic lateral sclerosis 
(ALS, or motor neurone disease). 

What published data there are’* suggest 
that as few as 1 in 100 dogs that test posi- 
tive for this common mutation will develop 
the very rare disease, and Petunia’s condi- 
tion was also consistent with more-treatable 
spinal disorders. But her owners chose to 
put her to sleep, convinced that she would 
otherwise suffer progressive and irreversible 
paralysis and eventual death. 

Genetic testing for pets is expanding. 
Hundreds of thousands of dogs have now 
been genetically screened, as Petunia was, 
and companies are beginning to offer tests 
for cats. But the science is lagging. Most of 
these tests are based on small, underpow- 
ered studies. Neither their accuracy nor 
their ability to predict health outcomes 
has been validated. Most vets don’t know 
enough about the limitations of the studies, 
or about genetics in general, to be able to 
advise worried owners. 

Pet genetics must be reined in. If not, 
some companies will continue to profit by 
selling potentially misleading and often 
inaccurate information; pets and their own- 
ers will suffer needlessly; and opportunities 
to improve pet health and even to leverage 
studies in dogs and cats to benefit human 
health might be lost. Ultimately, people will 
become more distrustful of science and 
medicine. 


WEAK SCIENCE 

Global spending on pets has grown by 14% 
over the past 5 years (see ‘Animal lovers’), 
and worldwide annual spending on pet 


care is currently estimated to be about 
$109 billion. 
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Genetics is one of the newest additions 
to this booming industry. Worldwide, at 
least 19 laboratories are now marketing 
genetic-testing products. Some vets use 
the results to help to diagnose sick pets, or 
to recommend that healthy ones be subject 
to expensive and sometimes invasive tests, 
such as bone-marrow biopsies. Some dog 
breeders use the tests to try to reduce the 
incidence of inherited diseases. At least 
one US veterinary hospital chain is now 
recommending genetic testing for all dogs, 
saying that the results allow “individual- 
ized healthcare” and can guide behavioural 
training. 

We believe that three major problems 
plague pet genetic testing in its current state. 


Lack of validation. In both humans and 
animals, mapping genetic variants to risk 
of disease is incredibly challenging. But 
most dog genetic tests are based on studies 
of candidate genes, which is a problematic 
approach. In such studies, researchers test a 
handful of human or animal genes for muta- 
tions that both match an expected inheritance 
pattern and seem likely to be pathogenic, for 
instance because they affect the structure of 
an expressed protein. 

In humans, fewer than 2% of candidate- 
gene studies have stood up’ to further 
investigations using more-advanced meth- 
ods, such as genome-wide association 
studies. As a result of these shortcomings, 
geneticists who study humans must now 
bring more evidence to designate a genetic 
variant as ‘disease-causing. Through collabo- 
rative efforts involving industry, academia, 
physicians and patients, every clinical variant 
identified is now scored on a five-point scale, 
from ‘pathogenic’ to ‘benign’**. (Clinical var- 
iants are those that are linked to medically 
important phenotypes.) 

No such careful reassessment has 
occurred in veterinary medicine. Many of 
the 200 tests offered by companies are based 
on only a single small candidate-gene study 
(see, for example, go.nature.com/2nquntx). 


FRANCK ROBICHON/EPA/REX/SHUTTERSTOCK 
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Spending on pet care is increasing around the world. 


The data obtained are rarely made public. 
Neither is the information required to con- 
textualize the results. Providers who have 
followed up with owners to check the accu- 
racy of their tests’ health predictions have 
reported case studies involving a small num- 
ber of dogs®, but nothing on the scale needed 
to provide statistically reliable results’. 


Imprecise results or interpretation. A 
recent analysis of DTC results supplied by 
human genetic-testing companies found 
that, ofall genetic variants presented in peo- 
ple’s test results, 40% were inaccurate®. Other 
variants were reported by DTC companies 
to increase disease risk, but are considered 
benign by independent labs. Finally, the 
companies failed to screen for all known 
disease-linked mutations for any given 
gene, giving an incomplete picture of genetic 
susceptibility to disease. 

The same problems apply to the DTC 
products now offered for pets. Dogs, for 
example, are frequently screened for muta- 
tions in the gene ABCB] that are associated 
with dangerous sensitivity to several com- 
mon drugs. Three different mutations have 
been associated with this phenotype, but 
the documentation provided by genetic- 
testing companies implies that they test for 
only one. Thus, a dog declared ‘lear’ for a 
given gene might still harbour other known, 


clinically relevant mutations in that gene that 
the company has not tested for. 


Conflicts of interest. In the absence of 
effective industry self-policing or govern- 
ment regulations, there are many potential 
conflicts of interest associated with making 
profits from pet owners. As an example, a 
pet health-care corporate database could be 
used to identify the breed ofan owner’s dog. 
The corporation owning the database might 
then notify the clinic (which might be owned 
by the same company) that a genetic test for 
a specific illness is warranted, regardless of 
that test’s medical value. If the test comes 
back positive, the clinic’s vet might recom- 
mend preventive steps, such as specific pet 
foods (made by the same company), periodic 
screening tests (performed by the company’s 
clinical lab), and more-frequent exams (per- 
formed at the company’s vet clinics), even 
though there may be low or no risk of disease 
in the first place. 


TSUNAMI OF DATA 
The companies do provide multiple caveats 
about interpreting genetic test results in their 
terms and conditions. Yet their warnings are 
not as prominent as their claims. And com- 
panies tend to be vague about which variant 
they test for, even in publications’. 

All of this is worrying because veterinary 
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medicine is about to be hit by a tsunami of 
genomic data. Currently, pet owners can 
purchase the tests directly from companies 
and obtain results that ostensibly report on 
risk for more than 100 different diseases 
for less than $200, and some kits provide a 
‘health report’ to take to the vet. Within the 
next five years, owners will have the option 
of whole-genome sequencing for pets of any 
species, using the same technology used for 
human medicine. Such results could accel- 
erate genetic-based diagnoses and even the 
genetic editing of pets. 

In principle — with large, well-powered 
studies, involving perhaps tens of thousands 
of animals — genetic testing could be used to 
predict the risk of common diseases affecting 
millions of pets, including cancer, heart dis- 
ease, diabetes and epilepsy. The use of pets 
in preclinical and clinical research to ben- 
efit humans is growing. For certain diseases, 
such as cancer and diabetes, researchers are 
sometimes using pet dogs as models in pref- 
erence to lab rodents. Although it is harder 
to control environmental variables and to 
obtain high numbers of subjects, the use of 
dogs offers the advantages of spontaneous 
disease models that might be closer to the 
human version. Additionally, pet owners 
voluntarily enrol their pets and continue to 
feed and care for them at home. Thus, many 
of the costs and regulations that govern the 
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A cheek swab is all that’s needed for a genetic test. 


same species when they are used in the lab 
can be bypassed. Such methodological shifts 
make it imperative that the science of pet 
genetics is robust. 

Largely because current regulatory mecha- 
nisms for pets focus on treatments, not diag- 
nostics, there is no oversight of genetic testing 
for pets in the United States or in the Euro- 
pean Union. To our knowledge, neither the 
US Food and Drug Administration (FDA) 
nor the European Medicines Agency (EMA) 
has proposed quality standards, or sought 
public input about the tests’ pros and cons. 

To bring the untamed wilderness of pet 
genetic testing under control, we propose 
five steps. 


Establish standards. All stakeholders, from 
pet owners and the companies providing the 
tests to researchers and regulators, should 
work together to establish standards both for 
testing methodology and for the reporting 
of test results. Samples should be collected, 
stored, shipped and analysed in specified 
ways; the number of animals should exceed 
a certain threshold to support claims made 
about a specific test; and reports should meet 
specified criteria. Reports must be suffi- 
ciently clear, with information about poten- 
tial caveats provided at a predetermined 
point in the document and in easy-to-read 
language. Organizations such as the World 
Small Animal Veterinary Association could 
take the lead on convening meetings with 
this goal. 


Create guidelines. A select group of 
stakeholders — which would comprise 
working groups — could develop guide- 
lines that specify the standards that should 
be adopted. Such a document could be sim- 
ilar to the ethics guidelines” for the use of 
novel technologies in lab-animal research 
developed by the British Small Animal Vet- 
erinary Association science subcommittee. 
Ideally, the instructions would have global 
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reach, and as pet genetic testing grows, 
such guidelines might eventually need to 
become law. 


Share data. Existing pet genetic databases, 
which are produced by industry, academia 
and government agencies, should be shared 
among all parties, with steps taken to ensure 
that individual pets and owners can't be 
identified. Companies might be incentiv- 
ized to share data if the proposed guidelines 
require sample sizes that can be achieved only 
through pooling data. To improve transpar- 
ency, stakeholders can draw on the knowl- 
edge accumulated across the past two decades 
by scientists researching human genetics, who 
have been grappling with issues around data 
sharing and patient privacy. 


Recruit tools and expertise. Bioinfor- 
maticians, computer scientists trained in 
machine learning, and others with exper- 
tise in big data will be needed to manage and 
analyse the tremendous volume of incoming 
information. Such specialists might be per- 
suaded to enter the burgeoning field of pet 


ANIMAL LOVERS 


In the United States and Europe, spending on pet 
care has steadily grown over the past six years. 
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genetics because of the potential financial 
gains. Or they could be lured by the possi- 
bility of pet genetics offering an inroad into 
solutions to human disease. 


Educate counsellors. A cadre of pet genetic 
counsellors needs to be established from 
among vets and others working in animal 
care. These professionals would provide 
support and advice to pet owners follow- 
ing genetic tests and could potentially be 
affiliated with their counterparts in human 
genetic counselling at leading academic 
medical centres. 


DATA ETHICS 

Other issues might require ongoing public 
dialogue and the appointment of expert advi- 
sory boards. For instance, should for-profit 
pet-testing firms have unfettered access to all 
the results they generate, for internal corpo- 
rate use or to sell to other companies engaged 
in pet care — as is the case now? Can an 
owner, who might want to breed their pet and 
sell the offspring, keep any results secret (as 
they can currently) or should they be required 
to divulge this information to buyers? 

In the United States alone, some 70% of 
households own pets. Done right, the use of 
genetic testing in companion animals could 
be a powerful way to better connect people 
to the possibilities of genetics for treating 
disease. Done wrong, it could erode trust in 
science for an increasingly sceptical public. m 
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German cabinet-maker David Roentgen’s eighteenth-century automaton of Marie Antoinette plays a table-top zither. 
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The automaton chronicles 


Our responses to robots echo down the millennia, say Stephen Cave and Kanta Dihal. 


he French philosopher René Descartes 

| was reputedly fond of automata: they 

inspired his view that living things 

were biological machines that function like 

clockwork. Less known is a strange story 

that began to circulate after the philosopher's 

death in 1650. This centred on Descartes’s 

daughter Francine, who died of scarlet fever 
at the age of five. 

According to the tale, a distraught 
Descartes had a clockwork Francine made: a 
walking, talking simulacrum. When Queen 
Christina invited the philosopher to Sweden 
in 1649, he sailed with the automaton con- 
cealed in a casket. Suspicious sailors forced 


the trunk open; when the mechanical child 
sat up to greet them, the horrified crew 
threw it overboard. 

The story is probably apocryphal. But it 
sums up the hopes and fears that have been 
associated with human-like machines for 
nearly three millennia. Those who build 
such devices do so in the hope that they will 
overcome natural limits — in Descartes’s 
case, death itself. But this very unnatural- 
ness terrifies and repulses others. In our era 
of advanced robotics and artificial intelli- 
gence (AI), those polarized responses per- 
sist, with pundits and the public applauding 
or warning against each advance. Digging 
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into the deep history of intelligent machines, 
both real and imagined, we see how these 
attitudes evolved: from fantasies of trusty 
mechanical helpers to fears that runaway 
advances in technology might lead to crea- 
tures that supersede humanity itself. 
Arguably the oldest known story of some- 
thing approximating AI can be found in the 
eighth-century-Bc Iliad, Homer’s epic poem 
of the Trojan War. In it, Hephaestus, disa- 
bled god of metalworking, creates golden 
handmaidens to help him in his forge: “In 
them is understanding in their hearts, and 
in them speech and strength, and they know 
cunning handiwork”. Hephaestus was > 
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> also supposedly responsible for the first 
‘killer robot? Talos. A mechanical bronze 
colossus featuring in the third-century-Bc 
epic Argonautica, it patrolled the shores of 
Crete, lobbing boulders at invaders. 

These fictions were grounded in reality: 
ancient Greek technologists were astonish- 
ingly skilled in mechanics and metalwork. 
In her forthcoming book Gods and Robots, 
classicist Adrienne Mayor describes bronze 
automata that featured at the Olympic 
Games two centuries before the Argonautica 
— a leaping dolphin and eagle in apparent 
flight. In his first-century-ap treatise On 
Automaton-Making, the mathematician- 
engineer Hero of Alexandria describes a 
fully automated puppet theatre that, through 
a combination of displaced grain, axles, 
levers, pulleys and wheels, could enact an 
entire tragedy. 

These classical stories reveal how, then 
as now, humanoid machines were mostly 
conceived as representing straightforward 
hopes — the ideal servant who always 
obeys, the perfect soldier who never tires. 
But as the influence of Greece declined 
over the first centuries AD, the West 
entered a millennium in which the skills of 
automaton-making were lost, along with 
the aspirations associated with them. As 
historian E. R. Truitt describes in Medieval 
Robots (2015), it was the Byzantine Empire 
and Arab world that preserved the mechani- 
cal arts over those centuries. Around AD 
850, for instance, the Banu Musa brothers 
in what is now Iraq published the Book of 
Ingenious Devices, which featured automata 
such as a water-powered organ. Thus, in the 
West, the ‘otherness’ of mechanical simu- 
lacra was compounded. Associated with 
the exotic and the idea of an ‘infidel’ East, 
automata were viewed for some time with 
awe and suspicion. 

The thirteenth century saw a resurgence 
of Western interest. Automata began appear- 
ing at courts as showpieces designed to wow 
visitors. The Count of Artois at the chateau 
of Hesdin in what is now northern France 
commissioned a cluster of mechanized 
beasts and androids that interacted with 
guests, scolding them and or even soak- 
ing them with water. At the same time, 
hints of darker themes began to emerge. A 
number of great medieval scholars, such as 
Roger Bacon and Albertus Magnus, were 
each rumoured to have created a bronze 
head that could answer any question — a 
proto-Siri. These stories end badly, with the 
oracle’s destruction, sometimes by people 
who mistrust it. In such cautionary tales, 
the creation of AI is an act of Promethean 
hubris, semi-divine power no mortal should 
possess — presaging the story of Descartes's 
daughter and Mary Shelley’s Frankenstein 
(see R. Holmes Nature 535, 490-492; 2016). 

The march of mechanical humanoids 
nevertheless persisted into the great cultural 
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and technological flowering of the Renais- 
sance. Evoking widespread wonder, hydrau- 
lic, spring-powered and clockwork automata 
proliferated in Europe, from mechanized 
angels in churches to Neptune automata in 
grottoes. Their mystical connotations began 
to give way to reflections of cutting-edge 
scientific developments: Leonardo da Vinci, 
for instance, drew up plans for a robotic 
knight operated internally by weights and 
pulleys. 

Over subsequent centuries, Descartes’s 
view of living things as complex machines 
inspired new heights in manufacture, 
and Europe reached peak automaton in 
the seventeenth to early nineteenth cen- 
turies. Master artisans built marvels of 
art-imitating-life, such as Jacques de 
Vaucanson's 1739 Digesting Duck. Sporting 
more than 400 moving parts in each wing 
and internal rubber tubing, it appeared to 
eat, drink and defecate. 
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But the duck had no ‘digestive’ mecha- 
nism: its ‘droppings’ were actually pre- 
fabricated pellets ejected from a hidden 
compartment. A few decades later in 
1770, inventor Wolfgang von Kempelen’s 
infamous chess-playing Mechanical Turk 
debuted. Although many suspected it was a 
hoax, nobody could quite work out how they 
were fooled until a revelatory article in 1857 
announced that it was operated by a person 
hidden inside. Awe over the transgressive 
nature of such mechanisms began to mingle 
with fears of deception, as reflected in the 
fiction of the time. In Prussian Romantic 
author E. T. A. Hoffmann’s 1816 short story 
“The Sandman, for example, the protagonist 
Nathanael is bewitched by the beauty of a 
woman called Olimpia. The discovery that 
she is an automaton drives him to suicide. 

The imaginative resonance of intelligent 
machines began to reach its fullest in the 
twentieth century, when industrialization 
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replaced the rhythms of nature with those 
of the production line. It was also a time of 
revolution and mechanized warfare. Against 
that backdrop, the word ‘robot’ was born in 
Czech writer Karel Capek’s 1920 play R.U.R. 
(Rossum’s Universal Robots). In the very work 
coining the term, the robots rebel against 
and destroy their creators. And that narra- 
tive of rebellion has proved to be the most 
potent of all our AI fears, retold repeatedly 
as technology evolves. 

During the cold war space race, the film 
2001: A Space Odyssey (1968) gave us HAL 
9000, the murderous spaceship supercom- 
puter. With the rise of the Internet, we got 
Skynet — a defence network that becomes 
self-aware in the Terminator films (start- 
ing in 1984) — and The Matrix (1999), 
featuring intelligent machines that farm 

humans whose 


“The tension minds unknow- 
liesin our ingly inhabit avir- 
conflicted desire ‘vl reality. Now, 
tocreatebeings With AI dominat- 
superhumanin 28 headlines, we 
capacity, but have sophisticated 
subhuman in robots again over- 
status.” throwing their wet- 


ware masters, from 
Ava in the 2015 
film Ex Machina to the android amusement- 
park hosts in the Westworld television series. 

The persistent trope of robot revolts 
reveals the paradox at the heart of our rela- 
tionship with intelligent machines. We want 
to create clever tools that can do everything 
we can do, and more. They will be the per- 
fect oracles, servants, soldiers, even lovers. 
To fulfil our hopes, they must have attributes 
such as intellect and agency — minds of their 
own, superior to ours. But, paradoxically, 
that is also why we fear HAL and Skynet. 
The tension lies in our conflicted desire to 
create beings superhuman in capacity, but 
subhuman in status. 

Our hopes continually threaten to 
collapse into such fears, but we hope nev- 
ertheless. Every robot rebel has its benevo- 
lent counterpart, such as C-3PO in the Star 
Wars franchise or the android child David 
in Steven Spielberg's 2001 film A.I. Artificial 
Intelligence. Both kinds of stories, the hope- 
ful and the fearful, reveal to us our complex 
emotional responses to AI. Understand- 
ing these and their deep history is crucial 
to making the most of life with intelligent 
machines. = 
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Books in brief 


The Immeasurable World 

William Atkins FABER & FABER (2018) 

An “axis where the absolute coexists with the infinite”: so 

William Atkins sees the quintessence of deserts. He searched for it 
on seven journeys, from the Arabian Peninsula’s ‘Empty Quarter’ to 
China’s Gobi Desert, recollecting his arid odyssey in this beautifully 
observed memoir-cum-natural-history. With him, we tread the grit 
of old atomic test sites in Australia’s Nullarbor Plain; meet fatalistic 
migrants at Arizona’s Mexican border; and contemplate the bleak 
ecodisaster of Central Asia’s disappearing Aral Sea. “The world has 
been done,” Atkins declares. But he sees it with new eyes. 


Ticker: The Quest to Create an Artificial Heart 

Mimi Swartz CROWN (2018) 

In this pacy, blow-by-blow account of the search for a viable artificial 
heart, Texas Monthly executive editor Mimi Swartz offers a vivid gallery 
of the medical pioneers who have jostled for the prize. Here are heart 
surgeon O. H. ‘Bud’ Frazier; inventors Robert Jarvik, Billy Cohn and 
Daniel Timms; and cardiac specialists Michael DeBakey and Denton 
Cooley, whose duel over implanting the first crude devices sparked 
the crusade. Decades later, that quest remains urgent: more than 

25 million US citizens have heart disease, yet only a few thousand 
hearts are available for transplant in any given year. 


Dreamers, 


and R, V 'Sionaries 
*volutionaries 


Dreamers, Visionaries, and Revolutionaries in the Life Sciences 
Oren Harman and Michael R. Dietrich (eds) UNIV. CHICAGO PRESS (2018) 
They may infuriate even as they inspire; destabilize, as well as advance. 
The visionaries of biology, note science historians Oren Harman and 
Michael Dietrich, are masters of “dramatically bold, even fantastic” 
thinking about big problems. This compelling edited volume explores 
the work of 19 innovators, including lranian-American cancer 
researcher Mina Bissell, who studies tumour microenvironments; 
Canadian John Todd, who engineered solar aquatic sewage treatment; 
and Peter Kropotkin, the Russian biologist who championed 
mutualism in nature during the Darwinian revolution. 


One of Ten Billion Earths 

Karel Schrijver OXFORD UNIV. PRESS (2018) 

Like distant mirrors, exoplanets reveal much about Earth and its 
habitability even as they relay explosive new insights into planetary 
formation and extraterrestrial life. Astrophysicist Karel Schrijver’s 
lucid, fact-packed primer ranges over everything from the Goldilocks 
zone and stellar nurseries to disrupted exosystems and the vagaries 
of “living on a pale blue dot”. And more: if you ever wanted to 
understand nomad planets or catch a glimpse of a volcanic eruption 
on Jupiter’s moon lo (observed by the New Horizons spacecraft on 
its way to Pluto), this book delivers the goods. 


The Omega Principle 

Paul Greenberg PENGUIN (2018) 

The cacophony of claims about the impact of omega-3 fatty acids 
on health is drowning out concerns over the associated industry, 
argues Paul Greenberg in this sobering investigation. He followed 
the omega-3 trail around the world and into history to reveal the 
price of a century of industrial reduction — the boiling down of a 
gargantuan tonnage of marine organisms for products from animal 
feed to oils for supplements. Notably, the mass harvesting of krill 
and forage fish such as menhaden looks increasingly unsustainable 
as overfishing and oceanic warming take their toll. Barbara Kiser 


26 JULY 2018 | VOL 559 | NATURE | 475 


© 2018 Springer Nature Limited. All rights reserved. 


Correspondence 


Earth- observation 
data must stay free 


US President Donald Trump’s 
administration is considering 
whether to charge for access to 
Earth-observation data (Nature 
556, 417-418; 2018). As former 
director of the Group on Earth 
Observations (GEO), I urge the 
US government not to falter in 
its global leadership in open- 
data policies or to dismiss the 
importance of their worldwide 
benefits, particularly for the 
United States. 

As you point out, this would 
not be the first time that the 
United States has charged for 
Landsat satellite data. Until 2008, 
it used a cost-of-fulfilling-user- 
requests model, bringing in 
about US$4.5 million annually. 
The largest purchaser of data was 
the federal government itself, 
so its net profit was negligible. 
Meanwhile, external research 
and innovation withered. 

Once replaced by an open- 
access policy, economic returns 
followed: in 2011, for example, 
these were $1.7 billion for the 
United States and $400 million 
for other countries, bringing 
the global total to $2.1 billion. 
Today, there are more than 
400 million international open 
Earth-observation data and 
information resources (see www. 
geoportal.org). 

We must remain committed 
to keeping Earth-observation 
data — collected at taxpayers’ 
expense — freely and readily 
available to inform policy 
decisions, including those called 
for in the Sendai Framework 
for Disaster Risk Reduction, the 
United Nations 2030 Agenda for 
Sustainable Development and 
the Paris climate agreement. 
Barbara J. Ryan Bonita Springs, 
Florida, USA. 
bjrgeneva@gmail.com 


Predict trajectory, 
not emergence 


It is important to distinguish 
between predicting the 
emergence of infectious 


diseases and their trajectory 
(see E. C. Holmes et al. 
Nature 558, 180-182; 2018). 
It is currently impossible to 
predict emergence, but feasible 
and effective to predict the 
trajectory. 

For example, modelling 
of established outbreaks was 
important for responding to 
those caused by the Ebola, Zika, 
HIN1 influenza and severe acute 
respiratory syndrome (SARS) 
viruses. It helped decision- 
makers to plan bed capacity (M. 
I. Meltzer et al. MMWR Morb. 
Mortal. Wkly Rep. 65, 85-89; 
2016), anticipate travel-related 
spread (I. I. Bogoch et al. Lancet 
385, 29-35; 2015) and plan 
vaccine trials (A. Camacho et al. 
Vaccine 35, 544-551; 2017). 

Several US government 
agencies maintain such 
capabilities, including the 
Centers for Disease Control 
and Prevention, and the 
Biomedical Advanced Research 
and Development Authority. 
And the White House Office of 
Science and Technology Policy 
published a report in 2016 that 
identified epidemic prediction 
and forecasting asa crucial 
tool in outbreak preparedness 
and response (see go.nature. 
com/2ublwjt). 
Caitlin M. Rivers Johns Hopkins 
Center for Health Security, 
Baltimore, Maryland, USA. 
Samuel V. Scarpino 
Network Science Institute, 
Northeastern University, Boston, 
Massachusetts, USA. 
crivers6@jhu.edu 


Biomedical success 
on the cheap 


A handful of science students 

at Bellevue College, a small 
community institution in the 
United States, have pioneered 
their own training in biomedical 
research methods — with 

no wet lab facilities and ona 
minimal budget. 

Our group identified a 
promising project by tapping 
into advances in computational 
analysis and the vast, readily 


accessible amounts of genomics 
data. We decided to search for 
viral protein sequences that 
could be important for tissue 
tropism. 

Human pathogenic viruses 
were classified according to the 
tissue they infect (pulmonary, 
gastrointestinal, and so 
on) — irrespective of whether 
they were enveloped or 
non-enveloped RNA or DNA 
viruses. The students majoring 
in computer science developed 
programs to speed up protein- 
sequence comparisons and 
alignments in each category. 
The biologists identified 
consensus protein domains. 

Within a month, the students 
had localized a consensus 
amino-acid sequence on 
the adherence spikes of the 
chikungunya virus (CHIKV). A 
literature search revealed that 
this sequence is responsible 
for binding to a receptor on 
human cells and infecting them 
(R. Zhang et al. Nature 557, 
570-574; 2018). They then 
came up with the idea that this 
domain could be used as an 
antigen for producing a vaccine 
against CHIKV. Clinical 
trials of virus-like particles 
are ongoing (see go.nature. 
com/2nzgvpw and go.nature. 
com/2lhwjrh). 

It is heartening that a 
group of young scientists in a 
community institution, who 
had never before heard of 
CHIKYV, achieved so much as 
a result of their curiosity and 
motivation. 

Reza Forough” Bellevue College, 
Bellevue, Washington, USA. 
reza.forough@bellevuecollege.edu 
*On behalf of 4 correspondents; 
see go.nature.com/2ubuchn 


Health-care impacts 
of web shutdowns 


Governments in Africa, 

Asia and the Middle East are 
increasingly shutting down the 
Internet as a blunt response to 
threatening situations such as 
political unrest (see go.nature. 
com/2msruv2). We call on 


inter- and non-governmental 
organizations to put pressure 
on governments to evaluate the 
potentially devastating effects 
of this tactic on health care. 

In Cameroon, for example, 
the government cut off the 
Internet for three months last 
year. This disrupted crucial 
services such as the GiftedMom 
smartphone app, which 
connects rural mothers with 
physicians and guides them 
through infant vaccinations, 
in a country where children 
are almost twice as likely as the 
global average to die before 
their fifth birthday. 

Furthermore, Cameroon 
was unable to submit 85% of 
its health-performance data to 
the District Health Information 
System data set DHIS2, which 
is used to guide funding 
decisions. Health workers 
received only partial pay 
because they could not report 
their progress to the World 
Bank’s performance-based 
financing system. 

Yet no empirical research 
exists for the health impacts 
of such shutdowns. We need 
to know, for example, the 
risks they pose to emergency 
services and to funding for 
Internet-enabled medicine. 
Policymakers must have 
comprehensive data on 
interruptions to online services 
if health outcomes in the 
developing world are to be 
improved. 

Okwen Mbah, Miriam Nkangu 
Effective Basic Services (eBase) 
Africa, Nkwen Bamenda, 
Cameroon. Zak RogoffAccess 
Now, Washington DC, USA. 
okwen@ebaseafrica.org 
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An early hominin arrival in Asia 


An excavation has uncovered stone tools in China that are about 2.1 million years old. This evidence pushes back the date of 
the earliest established signs of a hominin species outside Africa. SEE LETTER P.608 


JOHN KAPPELMAN 


This theme is often explored in post- 

apocalyptic science fiction, and it was 
the case for most of Earth’s history until early 
hominins (members of the evolutionary tree 
that includes humans, extinct species of the 
genus Homo and other closely related bipedal 
species) moved out of Africa into unknown 
territories, to eventually populate the planet. 
Which hominins made this journey? When 
and how did they migrate, and by which 
routes? Field-based research that uncovers 
traces of these early dispersals could provide 
some answers, as well as insight into hominin 
behaviour. On page 608, Zhu et al.' report evi- 
dence of hominin activity in China more than 
2 million years ago, revealing an earlier time 
frame than was previously known for a homi- 
nin presence outside Africa. 

Establishing the first known occurrence 
of a species requires incontrovertible evi- 
dence to confirm the presence of the species 
and firm support for the age of the geologi- 
cal material that contains such a sample. The 
remains of species that were common where 
they lived are usually found with ease in fossil 
deposits. By contrast, populations of ancient 
hominins were probably rare, and their fossil- 
ized remains are generally scarce. Just a single 
finger bone can suffice to document a hominin 
presence’. However, when hominins began to 
fashion tools from stones, at least 3.3 million 
years ago’, the purposely chipped cobbles and 
flakes that they produced became another call- 
ing card that can attest to a hominin presence. 

Until now, the oldest known hominin site 
outside Africa* was in Dmanisi, Georgia. Exca- 
vations at that site uncovered spectacular finds 
of the roughly 1.85 million- to 1.78-million- 
year-old remains of multiple hominins and 
stone tools. Numerous later sites of hominin 
activity, at locations stretching from western 
Europe to eastern Asia’, have also been inves- 
tigated thoroughly. Zhu and colleagues’ report 
of signs ofa hominin presence at Shangchen in 
China’s Loess Plateau (Fig. 1) is based on evi- 
dence from only stone tools, and the research- 
ers found that these tools were distributed 
in layers of sediment that date back to about 
2.1 million years ago. 

The age of many hominin sites has been 


[== a world nearly devoid of humans. 
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Figure 1 | Ancient sites of hominin presence. Zhu et al.' report their discovery of approximately 
2.1-million-year-old stone tools at Shangchen in China's Loess Plateau, which provides the earliest known 
evidence for the presence of hominins (the evolutionary group that includes humans, extinct species 

of the genus Homo and related bipedal species) outside Africa. The dates (Myr, million years ago) and 
locations of some of the important earliest known sites of hominin fossils and stone tools are shown. 


estimated by methods such as radiometric 
dating, or by the chemical fingerprinting of 
volcanic rocks. Because Shangchen lacks vol- 
canic rocks, Zhu and colleagues instead used 
palaeomagnetic dating to analyse sediment 
layers called palaeosols (fossilized soils) and 
loess (wind-blown silts). This technique relies 
on the fact that Earth’s magnetic field under- 
goes random reversals’, in which the magnetic 
north pole becomes the magnetic south, and 
vice versa. Magnetic minerals in sediments act 
like small compasses that register the polarity, 
and when such sediments become rock, the 
polarity of that time is locked in. The pattern 
of polarity reversals in these ancient sedi- 
ment layers provides a fingerprint that can 
be matched to a dated reference called the 
geomagnetic polarity timescale (GPTS)’. 

The site at Shangchen contains steep, deeply 
incised outcrops that include slopes covered 
in vegetation, which makes sample collection 
a literal cliffhanger. The authors’ data provide 
a convincing correlation to the GPTS, and the 
team was able to identify two notable events 
from the GPTS (the brief Réunion Subchron’ 
and the later, but slightly longer, Olduvai 
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Subchron), which bookend the layer that con- 
tains the oldest artefacts. Can we be certain 
that the researchers found the oldest tools at 
the site? The team’s ability to reach the deeper 
layers was limited because of the present active 
farming of the fields. Investigating such layers 
should be a goal of future work. 

The 96 excavated stone tools that Zhu et al. 
analysed were mainly small stone flakes and 
cobbles, which contrasted sharply with the 
fine-grained material that surrounded the 
artefacts. The authors propose that hominins 
transported these rocks from the Qinling 
Mountains to the south. If the exact source 
of the stones could be determined, it would 
provide important information about how far 
these hominins transported raw materials. 

The stone tools are simple in form: the 
palm-sized rock cores had a limited number 
of flakes removed, with some of the flakes 
showing an apparent subsequent resharp- 
ening, and several cobbles might have been 
used as hammerstones. These characteristics 
closely align the Shangchen tools with those 
ofa similar age found in Africa’®. The authors 
do not report any matching refit of flakes to 


each other or to the cores from which they 
were chipped, so flake production might have 
occurred elsewhere. However, because the 
steepness of the slopes at the site rarely per- 
mitted the team to open large excavations, this 
prospect remains to be examined thoroughly. 

What were the stone tools used for? 
Zhu et al. note the discovery of animal remains 
near the oldest tools, including bones belong- 
ing to bovids (a family that includes antelopes 
and cattle), cervids (comprises deer) and suids 
(pigs). The authors did not address whether 
this association provides evidence of tool use 
for carcass processing. To evaluate the possi- 
bility, it would be necessary to identify signs 
such as: cut marks on the bones that point 
to flesh removal with tools; breakage marks 
on the bones, suggesting that they had been 
hammered to extract bone marrow; tool wear; 
or the presence of trace biological residues on 
the tools. If this question is investigated in the 
future, the degree of post-recovery cleaning of 
the stone tools might have compromised the 
use of approaches" such as residue studies. 

Hominins originated in Africa possibly 
more than 6 million years ago’”. The absence 
in Eurasia of both any hominin sites dating to 
the early portion of this interval and any fossils 
that can be attributed to hominin genera such 
as Australopithecus and Paranthropus, found in 
Africa until about 1 million years ago, points 
instead to a species of Homo as the most 
likely candidate for the first hominin to have 
left Africa. The oldest known African fossil 
attributed to Homo is a 2.8-million-year-old 
jawbone from Ethiopia’, which provides a 
time estimate for the earliest possible exit of 
the genus from Africa. Of course, the actual 
date of departure might have been later. 

The hominin dispersal probably occurred 
under the variable climates of the Pleistocene 
ice age. Does a migration to higher latitudes 
suggest the evolution of behavioural adapta- 
tions to colder climates? Perhaps. The con- 
ventional interpretation that the palaeosols of 
the Loess Plateau formed during favourable 
warm and wet conditions, and its loess under 
harsher cold and dry conditions, is probably an 
oversimplification”’, but at the Shangchen site, 
palaeosol layers containing stone tools out- 
number loess layers containing such tools by 
a ratio of about 2:1. Rather than maintaining 
a continuous occupation of the Loess Plateau, 
the hominin population might have increased 
or dwindled, depending on the climate’. 

The roughly 14,000-kilometre trek from 
eastern Africa to eastern Asia represents a 
range expansion of dramatic proportions. The 
dispersal of hominins was probably facilitated 
by population increases as they moved into 
new territories and filled empty niches, and 
could also have been driven by the phenom- 
enon of resource depletion that underlies the 
high mobility of today’s hunter-gatherers'®. Yet 
even with a dispersal rate of only 5-15 kilo- 
metres per year, a value well inside the daily 
foraging range of modern hunter-gatherers””, 


the distance between Africa and Asia could 
have been covered in just 1,000-3,000 years. 
The present record of hominin sites and the 
dating techniques that are currently available 
to researchers are not sufficient to resolve a 
dispersal event of such potential speed, or to 
determine its exact form, but we can surely 
look forward to more finds that will help to 
solve this migration mystery. m 
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Quantum optics 
without photons 


Atoms can exhibit wave-like behaviour to form matter waves. Such waves have 
been used to model the basic processes that underpin how light interacts with 
matter, providing an experimental platform for future research. SEE LETTER P.589 


ALEJANDRO GONZALEZ TUDELA 
& J. IGNACIO CIRAC 


he fundamental theory that describes 
the interaction between light and 
matter at microscopic scales is known 
as quantum optics. One of the most striking 
and tangible consequences of this theory is 
that excited atoms in the quantum electro- 
dynamic vacuum — a state that contains 
no photons, often referred to simply as the 


Potential- 
energy well 


Atom 


vacuum — can decay to their ground state 
by emitting a photon, a process called spon- 
taneous emission. In 1946, physicist Edward 
Mills Purcell proposed that this process can be 
tailored by structures that alter the photonic 
environment’, such as photonic crystals’, 
engineered dielectrics (insulators) in which 
light cannot propagate at certain frequency 
ranges. In the 1990s, it was predicted? that 
spontaneous emission in photonic crystals 
leads to exotic decays, in which photons 


Figure 1 | Matter-wave emitters. Krinner et al.* trapped rubidium atoms in a one-dimensional optical 
lattice formed by the interference of laser fields — the interference generates a periodic pattern of light 
intensity that corresponds to a series of potential-energy wells, in which atoms can be confined. Each 
atom has two internal states, one of which (red) is confined to the atom and the other (blue) which is 


unconfined and overlaps with adjacent empty wells. By tuning the parameters of the system, the authors 
could make occupied lattice sites emit atoms as waves (dotted arrows) that travel along the lattice. The 
emission mimics that of photons from atoms confined in photonic crystals (materials engineered so that 
light cannot propagate at certain frequency ranges). 


26 JULY 2018 | VOL 559 | NATURE | 481 
© 2018 Springer Nature Limited. All rights reserved. 


| RESEARCH | NEWS & VIEWS 


adopt a ‘superposition’ state in which they are 
simultaneously emitted into the surrounding 
environment but also localized around the 
emitting atoms. On page 589, Krinner et al.* 
report the first observation of the dynamics 
of these exotic decays, not using photons, but 
using a system of trapped, ultracold atoms. 

The atoms in Krinner and colleagues’ 
experiments are trapped in optical lattices’, 
which are formed by the interference of 
counter-propagating laser fields — the inter- 
ference generates a periodic pattern of light 
intensity in which the atoms are confined. 
Given their quantum-mechanical nature, the 
atoms can tunnel between neighbouring sites 
in the lattice at a rate that can be adjusted by 
altering the lasers’ intensities. Because such 
systems are highly controllable and exhibit 
low decoherence (the atoms are well isolated 
from their environment and thus behave ide- 
ally), they are an almost perfect platform for 
simulating complex quantum problems found 
in fields such as condensed-matter and high- 
energy physics (see go.nature.com/2uied19). 
Krinner et al. now demonstrate that trapped- 
atom systems can also be used to simulate 
quantum optical problems. 

The authors’ experiments are based on a 
proposal® published in 2008. The idea is to use 
atoms — in this case, rubidium atoms — that 
have two internal states, which respond to a 
one-dimensional optical lattice in different 
ways (Fig. 1). One state (let’s call it the f state) 
‘sees’ regions of high light intensity as deep 
optical potential-energy wells from which it 
cannot move, whereas the other (the a state) 
hardly notices the wells, so that atoms in that 
state can propagate through the optical lat- 
tice as a matter wave. An atom in state f thus 
represents a matter-wave emitter in an excited 
state, whereas an atom in state a behaves likea 
photon that can be emitted through spontane- 
ous emission. To complete the analogy with 
atomic decay phenomena in a structured pho- 
tonic environment, the two internal states are 
coupled to each other using external fields, so 
that an initial excitation can be transformed 
into a propagating matter wave. 

In their experiments, Krinner et al. model 
the simplest scenario of spontaneous emission 
in a photonic crystal, but one that potentially 
offers the most insight into such processes: 
the spontaneous emission of a single photon 
into the vacuum. By tuning the experimental 
parameters of their system, they observed a 
phenomenon known as fractional decay” ; 
in which the emitter ends up in a quantum 
superposition of being both excited and hav- 
ing decayed to the ground state. The authors 
also report direct evidence that the probabil- 
ity of the emitter remaining in the excited 
state does not decrease exponentially over 
time. Both the non-exponential behaviour 
and the fractional decay are some of the most 
peculiar effects that photonic crystals induce 
in quantum emitters. The new measure- 
ments are analogous to previously reported 
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measurements of the spontaneous emission of 
photons in the visible-light”* and microwave” 
regions of the electromagnetic spectrum, but 
in those studies, it was not possible to measure 
the dynamics of the decay. 

The authors’ experimental platform offers 
several useful features in addition to its excel- 
lent controllability and low decoherence. 
First, it is flexible enough to emulate photonic 
crystals that have different geometries, and 
to model 3D environments in which further 
unusual features of spontaneous emission 
emerge’””’. It also allows access to param- 
eter regimes that are out of reach of optical 
implementations, such as situations in which 
the coupling between emitters and the envi- 
ronment is very strong. Moreover, it might 
enable spontaneous emission to be studied 
in environments that are even more exotic 
than photonic crystals, such as in materials 
known as topological insulators — although 
the experimental set-up would need to be 
adapted so that the atoms move in the way that 
simulates the movement of excitations in such 
materials, and the efficiency with which decays 
can be detected would need to be improved. 

Together with photonic platforms in the 
optical and microwave regimes, Krinner and 
colleagues’ system opens the way to studies of 
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the physics that emerges in unconventional 
quantum optical set-ups. If the experiments 
are extended to include many emitters, it might 
be possible to observe collective spontaneous- 
emission phenomena that cannot be predicted 
using current computational methods, or even 
to engineer interactions among the emitters that 
cannot be produced using other platforms. m 
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Viruses cooperate to 
defeat bacteria 


It emerges that viruses called phages, which infect bacteria, can suppress the 
bacterial immune system during an initial wave of unsuccessful infection, 
enabling subsequent viral infection to succeed. 


AUDE BERNHEIM & ROTEM SOREK 


acteria and the viruses that infect them, 
known as phages, are engaged in a con- 
stant arms race. Bacteria continually 
evolve new mechanisms of resistance against 
viruses, while phages evolve countermeas- 
ures to overcome these defence mechanisms. 
Writing in Cell, Borges et al.' and Landsberger 
et al.” reveal how phages can ‘collaborate’ to 
shut down the bacterial immune system and 
achieve a successful infection. Although an ini- 
tial viral attempt at infection fails, this enables 
a subsequent phage infection to be successful. 
This example of ‘cooperation between geneti- 
cally identical individuals of a viral popula- 
tion illuminates a previously unknown group 
strategy of phages, and provides an interesting 
example of viral ‘altruism. 
The bacterial anti-phage defence system 
called CRISPR-Cas recognizes and targets 
foreign nucleic acids in a sequence-specific 
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manner’. To block CRISPR-Cas defences, 
phages express genes that encode proteins 
that inhibit the function of the CRISPR-Cas 
machinery*”. Because these anti-CRISPR 
proteins are encoded in the phage genome 
(Fig. 1), a phage must enter a bacterial host 
cell and begin to express them to mount a 
counter-attack. Yet this raises a conundrum. 
CRISPR-Cas defences can attack phage DNA 
as soon as it enters a bacterial cell°, before 
the phage gets a chance to express and use 
its anti-CRISPR proteins. So what purpose 
do anti-CRISPR proteins serve if the help 
they provide is likely to arrive too late for the 
individual virus that expresses them? 

To investigate this, Borges, Landsberger and 
their respective colleagues studied the bacte- 
rium Pseudomonas aeruginosa and examined 
viral anti- CRISPR proteins that target a version 
of the bacterial CRISPR-Cas system called 
type I-F. Both groups observed that the initial 
ratio between the number of phages and the 
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Figure 1 | A viral mechanism to thwart bacterial 
defences. The bacterial protein and RNA complex 
termed CRISPR-Cas recognizes and destroys foreign 
DNA, such as that of invading bacteria-infecting 
viruses called phages, in a sequence-specific manner’. 
Some phages encode anti-CRISPR proteins that bind 
to CRISPR-Cas and inhibit its function*’. However, 
how such proteins aid viruses has been unclear, given 
that an initial infecting phage is probably targeted and 
destroyed by CRISPR-Cas, even if the phage expresses 
anti-CRISPR proteins before it is eliminated. Borges 
etal.' and Landsberger et al.” provide evidence that 
an initial unsuccessful phage infection that leads 

to expression of anti-CRISPR proteins generates 

an immunosuppressed bacterium that could then 
succumb to a subsequent phage infection, thereby 
facilitating successful phage replication. 


number of bacterial host cells, known as the 
multiplicity of infection (MOI), could be used 
to predict whether phage infection would be 
successful. When the authors tested bacteria 
that lacked CRISPR-Cas immunity, even a 
low dose of phages resulted in successful viral 
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propagation that caused the eventual demise 
of the bacterial culture. However, for bacteria 
and phage populations encoding CRISPRs and 
anti-CRISPRs, respectively, phages managed 
to replicate only if the initial MOI was high. 

The two groups followed different strat- 
egies to determine why the initial ratio of 
phages to bacteria is important for the success 
of phage propagation. Borges and colleagues 
used a genetic approach to test whether the 
inactivation of bacterial defences was due to 
the presence of anti-CRISPR proteins from 
more than one phage in the same host cell. 
The authors engineered a phage that was 
incapable of replication but did express anti- 
CRISPR proteins. By mixing these engineered 
phages with wild-type ones, the authors 
showed that the presence of the engineered 
phages could enable a wild-type phage infec- 
tion to succeed even at a low MOI that would 
normally fail. 

Landsberger and colleagues, by contrast, 
created a mathematical model to investigate 
the observed MOI dependence of phage infec- 
tions. They could recapitulate their experimen- 
tal findings only by using a model simulation 
in which bacteria enter an immunosuppressed 
state after surviving an initial infection by a 
phage that expressed anti-CRISPR proteins. 

Both groups’ results suggest that the first 
wave of phages that attempt infection succumb 
to CRISPR-Cas defences but manage to deliver 
anti-CRISPR proteins that immunosuppress 
the bacterial cell. This initial attack paves the 
way for a second wave of phages to successfully 
infect the now-defenceless bacterium. Having 
a high MOI increases the probability that a 
second wave of phage infection will occur. 

The authors of these two studies validated 
this hypothesis in different ways. Landsberger 
and colleagues infected a bacterial population 
with phages encoding anti-CRISPR proteins 
and then introduced a type of circular DNA 
called a plasmid into the bacteria. Plasmid 
DNA would normally be targeted by the bac- 
terial immune system, but the earlier phage 
infection limited this targeting of plasmid 
DNA, demonstrating that the bacterial cells 
were in an immunosuppressed state. Borges 
et al. demonstrated that phage-mediated 
immune suppression also occurs for another 
type of CRISPR-Cas system, called type I, 
suggesting that phage cooperation to enable 
immunosuppression of bacteria is a general 
principle that extends beyond a single type of 
CRISPR-Cas system. 

The studies by Borges, Landsberger and 
their colleagues provide fresh insights into 
viral group dynamics, and join a growing body 
of evidence indicating that viruses can benefit 
from the group behaviour of a viral popula- 
tion. For example, phages can coordinate their 
infection dynamics using communication by 
small molecules to determine whether an indi- 
vidual phage will replicate in an infected cell 
or enter a dormant state termed lysogeny”™. 
Perhaps phages also cooperate to overcome 
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components of the bacterial immune system 
other than CRISPR-Cas, such as restriction 
enzymes or other antiviral defence systems’. 
Many questions remain to be answered 
concerning the bacterial immunosuppres- 
sion generated by anti-CRISPR proteins. How 
long will a bacterium remain in this defence- 
less state? How does this state vary from cell 
to cell in a bacterial population? Although 
this cooperation strategy seems to provide 
one way for phages to neutralize CRISPR-Cas 
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complexes, perhaps other strategies remain to 
be discovered that do not require the sacrifice 
of the first wave of infecting phages. m 
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Histidine degradation 


boosts cancer therapy 


Clinical use of the anticancer drug methotrexate can be limited by its high toxicity. 
It emerges that a diet rich in the amino acid histidine increases the effectiveness 
of methotrexate treatment and lowers toxicity in mice. SEE LETTER P.632 


CHRISTIAN FREZZA 


ethotrexate was one of the first 
Merve anticancer drugs, and is 

a cornerstone of modern chemo- 
therapy for the treatment of certain solid 
tumours and blood cancers. However, this 
therapy must often be stopped prematurely 
because of its high toxicity in patients’ healthy 
cells. On page 632, Kanarek et al.' report 


that degradation of the amino acid histidine 
can increase the sensitivity of cancer cells to 
methotrexate, and the results suggest that this 
approach can also help to lower the side effects 
of methotrexate treatment in mouse models. 
An enzyme cofactor termed folate is 
required for DNA and protein synthesis. In 
1948, it was found’ that leukaemia is par- 
ticularly sensitive to drugs that block folate 
synthesis. Methotrexate inhibits the enzyme 
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dihydrofolate reductase (DHFR), which 
synthesizes the active form of folate known 
as tetrahydrofolate (THF) from its precursor 
dihydrofolate (DHF) (Fig. 1). However, when 
this drug is used in the clinic, it can substan- 
tially reduce the level of folate in healthy cells 
—a side effect that is only partially relieved 
by taking folate supplements. If this toxicity 
reaches a level that compromises the func- 
tion of healthy cells, therapy may have to be 
stopped prematurely. There is therefore great 
interest in trying to reduce the side effects of 
methotrexate treatment. 

To investigate the processes that affect the 
cellular response to methotrexate, Kanarek 
et al. used a cell line derived from a person 
with leukaemia. They employed a gene-editing 
technique that prevents the expression of indi- 
vidual genes throughout the entire genome in 
such cells, and tested whether a decrease in 
the expression of any particular gene reduced 
the cells’ sensitivity to methotrexate. One 
such gene that they identified encodes a pro- 
tein called SLC19A1, which transports folate 
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Figure 1 | The effectiveness of the cancer drug methotrexate can be 
improved by histidine degradation. a, The cofactor tetrahydrofolate 
(THE) is required for the function of enzymes, including those that catalyse 
processes needed for cell proliferation. The enzyme dihydrofolate reductase 
(DHFR) generates THF from dihydrofolate (DHF), and this catalytic 
reaction can be inhibited by the anticancer drug methotrexate. Kanarek 

et al.' report that, in mouse models and human cell lines, the sensitivity of 
cancer cells to methotrexate can be boosted by the metabolic degradation 
of the amino acid histidine. Histidine is metabolized to the amino acid 
glutamate through a series of intermediate molecules (black circles) in a 
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pathway that requires the enzyme histidine ammonia lyase (HAL), and, 
crucially for this effect on methotrexate treatment, the THF-dependent 
enzyme formimidoyltransferase cyclodeaminase (FTCD). b, In cancer cells, 
a large pool of THF is available for THF-dependent enzymes, such as FTCD 
or those needed for cell proliferation. c, In methotrexate-treated cancer 
cells, a decrease in THF levels reduces the activity of these enzymes and 
thereby reduces tumour growth. d, The authors report that tumour growth 
in mice given methotrexate and a histidine-rich diet was substantially 
decreased compared with c. This might be because histidine degradation co- 
opts THF for use with FTCD, thereby enhancing the effect of methotrexate. 
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and methotrexate into cells. This result was 
unsurprising, given that SLC19A1 has previ- 
ously been linked’ to sensitivity to methotrex- 
ate, but it confirmed the power of the authors’ 
approach. 

Kanarek and colleagues found that 
decreasing the expression of a gene that 
encodes the enzyme formimidoyltransferase 
cyclodeaminase (FTCD) also reduced the 
sensitivity of the cancer cells to methotrexate. 
FTCD degrades histidine, an amino acid that 
humans must obtain from their diet because 
the body cannot synthesize it. The authors rea- 
soned that, because FTCD requires THF as a 
cofactor for its activity, a decrease in the intra- 
cellular levels of FTCD could help to limit the 
reduction of cellular pools of THF that occurs 
with methotrexate treatment. Consistent with 
this hypothesis, the authors found that when 
the levels of FTCD were reduced by gene edit- 
ing, THF depletion in human cancer cells 
treated with methotrexate was less than that 
seen in methotrexate-treated cells in which 
FTCD levels were not reduced. 

On the basis of these findings, the authors 
predicted that the manipulation of other steps 
in the histidine-degradation pathway could 
affect the efficacy of methotrexate treatment. 
To test this, they again used gene editing to 
decrease the expression of other genes in this 
pathway in human cancer cells grown in vitro. 
They found that such decreases did indeed 
mimic the effect of decreased FTCD levels in 
preserving the cellular pool of THF in metho- 
trexate-treated cells. 

The authors then analysed a panel of 
cancer cell lines, and observed a correlation 
between cell lines that had low expression of 
the enzyme histidine ammonia lyase (HAL), 
which acts at the first step in the histidine- 
degradation pathway, and low sensitivity to 
the effects of methotrexate. Moreover, when 
Kanarek and colleagues analysed data’ available 
for people who have a form of leukaemia that 
is commonly treated with methotrexate, they 
found that patients who had high levels of HAL 
expression had higher survival rates than those 
with low HAL expression levels. Perhaps this is 
because methotrexate is more effective when 
high levels of HAL boost histidine degradation 
and the depletion of THF pools by FTCD. 

The authors therefore proposed that greater 
cellular histidine degradation should deplete 
the THF pool and thus increase the effec- 
tiveness of methotrexate. To explore this 
prediction, they generated tumours in mice, 
and gave the animals methotrexate and a his- 
tidine-rich diet. They treated the animals with 
lower doses of methotrexate than those nor- 
mally used in such models, to assess whether 
histidine had a synergistic effect in this con- 
text. The histidine-rich diet did indeed boost 
tumour-growth inhibition in response to low- 
dose methotrexate compared with the effects 
of methotrexate in animals that did not receive 
a dietary histidine boost. 

An unexpected and exciting finding of 


Kanarek and colleagues’ work is that, in their 
mouse models, the toxicity that is usually seen 
in non-cancerous tissues with methotrexate 
treatment did not occur when the animals 
were given a histidine-rich diet, despite the 
extra toxicity that would have been expected 
as a result of the depletion of THF in these 
tissues. The results not only indicate that 
histidine can increase methotrexate-driven 
cytotoxicity in cancer cells, but also raise the 
possibility that some mechanism is reducing 
the toxic effects of THF depletion in normal 
tissues. Do normal and tumour tissue differ in 
sensitivity in this context as a result of having 
different thresholds for toxicity, or is the rate of 
histidine degradation different in normal and 
tumour tissues? 

Given the technical challenges involved in 
measuring certain intermediate molecules 
in the histidine-degradation pathway, histi- 
dine’s contribution to the folate pool still needs 
to be quantified in vitro and in vivo to validate 
the authors model. Furthermore, there are 
other pathways” , in addition to that of histi- 
dine degradation, for example degradation 
of the amino acid methionine, that can affect 
the THF level in cells. It will be important to 
investigate the extent to which such pathways 
contribute to the THF pool, and whether they 
also affect methotrexate-associated toxicity. 

Kanarek and colleagues’ work indicates the 
potential value of conducting clinical trials to 
investigate whether histidine supplementation 
can boost the effectiveness of methotrexate in 
cancer treatment. If this is the case, further 
clinical trials would be needed to determine 
the optimal dosage. Histidine supplementation 
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might provide an inexpensive and safe strategy 
for increasing the efficacy of methotrexate. 
Supplementation with low doses does not have 
adverse effects in humans, although doses of 
more than 8 grams of histidine a day can lead 
to zinc deficiency®. However, there have been 
no extensive studies to investigate possible 
detrimental effects of a histidine-rich diet in 
humans. 

Finally, these results might have relevance 
for people with rheumatoid arthritis or other 
chronic autoimmune diseases, in which low- 
dose methotrexate is the standard treatment’ 
and folate supplements are the only strategy 
currently available to limit side effects. Could 
a histidine-rich diet reduce side effects in these 
people also, and increase their well-being? = 
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How the head governs 
the heart of fly species 


The males of two closely related species of fly respond differently to a female 
pheromone. It emerges that this difference is due to alterations in the activity of 
an evolutionarily conserved neural circuit in the brain. SEE ARTICLE P.564 


NICOLAS GOMPEL 
& BENJAMIN PRUD’HOMME 


articular sensory signals — a certain 
Pp odour or taste, for instance — can trigger 

opposing responses in different species. 
What changes in neural circuits could elicit 
such contradictory behaviours? On page 564, 
Seeholzer et al.' compare the circuits that con- 
trol courtship in males in two closely related 
species of fruit fly that behave differently when 
exposed to a particular pheromone. They find 
an explanation for these opposing behaviours 
that few neuroscientists would have predicted. 


© 2018 Springer Nature Limited. All rights reserved. 


While wandering on a rotten apple, a male 
fruit fly is likely to encounter a variety of other 
flies, only a few of which will be females of the 
same species. Identifying and courting these 
conspecific females is a crucial step in securing 
the male fly’s reproductive success. But this is 
not always an easy task. For instance, two sister 
species of fruit fly — Drosophila melanogaster 
and Drosophila simulans — are often found on 
the same fruits. Males of these species there- 
fore rely heavily on sensing pheromones to 
identify conspecific females”. 

To test whether a female fly is of the same 
species, the male touches her abdomen with 
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Figure 1 | Changes in the activity of a courtship-promoting neural circuit. a, Female fruit flies of 

the species Drosophila melanogaster produce the pheromone 7,11-HD. Seeholzer et al.' report that 

D. melanogaster males identify females of their species through neurons in the foreleg that express 

the protein Ppk23. These ppk23 neurons activate a population of neurons called vAB3 that, in turn, 
activates neurons in the P1 brain region, promoting courtship behaviour. vAB3 neurons also activate the 
mAL neuron population, which inhibits P1. In this species, vAB3 activation overrides mAL inhibition 

to trigger P1 activity, leading to courtship. b, The authors find that this neural circuit is evolutionarily 
conserved in males of a sister species, Drosophila simulans. However, mAL-mediated inhibition overrides 
vAB3 activation, so that P1 is inhibited and courtship behaviours are aborted. Whether this difference 
reflects changes in the activity of mAL neurons or changes in how the inputs are detected by P1 neurons 


remains unclear. 


his foreleg, sampling her pheromones using 
sensory neurons. D. melanogaster females 
produce the pheromone 7,11-HD, which pro- 
motes courtship behaviour when sampled by 
D. melanogaster males’. By contrast, D. simu- 
lans females do not produce 7,11-HD, and 
D. simulans males react to the pheromone (on 
either D. melanogaster females or D. simulans 
females doused with 7,11-HD) by abort- 
ing courtship behaviour. Hence, 7,11-HD is 
detected by males from both species, but elic- 
its contrary behavioural responses in each. 

What differences in the nervous systems of 
the male flies could account for the opposing 
role of 7,11-HD in regulating courtship deci- 
sions in these two species? Seeholzer et al. set 
out to answer this question by comparing the 
architecture and function of the neuronal cir- 
cuits that govern courtship behaviour in both. 
To achieve this, the authors used a combina- 
tion of sophisticated techniques: the genetic 
manipulation of specific populations of neu- 
rons; the activation of particular neurons with 
light using a technique called optogenetics; 
and the observation of neuronal activity in 
real time using functional imaging. 

In D. melanogaster males, the decision to 
initiate courtship is made by the activation 
of a region in the brain called P1, which con- 
tains neurons that integrate multiple sensory 
signals from outside the brain’. Seeholzer et al. 
found that P1 has the same role in D. simulans. 
Moreover, the neural pathway that detects 


486 | NATURE | VOL 559 | 26 JULY 2018 


7,11-HD and propagates the sensory input to 
P1 is structurally similar in D. melanogaster 
and D. simulans (Fig. 1). 

The authors showed that 7,11-HD is 
detected by a few taste-perceiving neurons 
in the male foreleg of each species that are 
characterized by expression of the protein 
Ppk23. These neurons activate a population 
of neurons called vAB3 that, in turn, directly 
excites P1 neurons’. However, the researchers 

demonstrated that 


“The authors vAB3 neurons also 
have shown activate a population 
clearly that of neurons called 
evolutionary mAL that directly 
changes can be inhibits P1 neurons’. 
embeddedmure. °°" PE*tomone- 
deeplyinbrain Ferre D athway 

therefore bifurcates 


ee 
circuits. into excitatory and 


inhibitory routes 
that converge on P1. This circuit presumably 
provides a mechanism for setting pheromone 
responses and for tightly controlling the 
excitation of P1. 

If the structure of the pheromone- 
processing pathway is evolutionarily con- 
served between species, how does 7,11-HD 
elicit a net excitation of P1 to trigger courtship 
in D. melanogaster, but not in D. simulans? 
Seeholzer et al. found that it’s simply a matter 
of balance. The mAL-mediated inhibition of 
P1 neurons completely masks their excitation 
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by vAB3 neurons in D. simulans, resulting in 
a net inhibition of P1. But mAL activity only 
dampens the activation of P1 by vAB3 in 
D. melanogaster. 

Seeholzer and colleagues’ work there- 
fore suggests that it is not the architecture 
of the circuitry that has changed during the 
evolution of the two species, but the rela- 
tive weight of the inhibitory and excitatory 
pathways that converge on P1 neurons. This 
is unexpected, because the idea that changes 
outside the brain can explain behavioural 
diversity has long dominated this field*"”. But 
the authors have shown clearly that evolution- 
ary changes can be embedded more deeply 
in brain circuits, where no one has looked 
so far. This is a provocative new conceptual 
framework that explains how a behaviour can 
change between closely related species. 

The most interesting aspect of Seeholzer 
and colleague's work is undoubtedly the unex- 
pected nature of the findings. But the study’s 
combination of state-of-the art techniques 
also makes it a technical triumph, for two rea- 
sons. First, using neurogenetic tools designed 
for the model organism D. melanogaster in the 
non-model species D. simulans was a chal- 
lenge. Second, comparing the structure and 
function of individual circuit components 
between species, in the haystack of neurons 
that makes up the brain, is uncharted territory. 

Of course, several questions remain. For 
instance, what caused the change in bal- 
ance between the inhibitory and excitatory 
branches of the courtship pathway? Perhaps 
it was a progressive change that emerged 
from genetic variation between individuals 
belonging to the last common ancestor of 
these species. Under sexual selection, initial 
variations that caused two groups of males to 
respond differently to 7,11-HD, as well as two 
groups of females to produce different levels 
of the pheromone, might have gradually built 
a barrier between the incipient species. The 
genetic basis of the difference in neural-circuit 
activity between the two species, however, 
remains to be defined. 

It is also unclear how fine-tuning of 
the circuit differs between the species. 
Seeholzer et al. showed that vAB3 and mAL 
neurons are active in both species, so there 
must bea difference in the way their signalling 
is propagated to P1. Perhaps there are changes 
in the levels of signalling by mAL neurons, 
or maybe P1 neurons respond differently to 
inhibitory inputs in each species. 

Finally, the study highlights that integrative 
nodes in neural circuits such as Pl might 
be poised to accommodate evolutionary 
changes that underlie variations in behav- 
iour. This concept is analogous to the evo- 
lution of particular organismal shapes. In 
such a setting, particular genes at key nodes 
in gene-regulatory networks are poised to 
accommodate changes that lead to the gen- 
eration of forms". Future work, following the 
approach pioneered by Seeholzer et al., will 


reveal the extent to which the evolution of 
behaviour is targeted to particular nodes in 
neural circuits. = 
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Ancient ice sheet had 
a growth spurt 


An analysis of ancient coral from the Great Barrier Reef reveals that global sea 
level fell rapidly at the end of the last glacial period. The findings suggest that ice 
sheets are more dynamic than was previously thought. SEE LETTER P.603 


PIPPA WHITEHOUSE 


r | the episode known as the Last Glacial 
Maximum extended from 26,500 to 
19,000 years ago, and is often thought of 

as a prolonged time during which glaciation was 

continuously at its greatest extent’. But on page 

603, Yokoyama et al.’ report an analysis of relict 

coral reefs that suggests that the already exten- 

sive ice sheets underwent a final burst of rapid 
growth about 22,000 years ago, at a rate sufficient 
to lower global mean sea level by an astounding 

17 metres in only 500 years — about ten times 

faster than the current rate of sea-level rise. This 

seems to have been followed almost immediately 
by ice-sheet retreat. Understanding the trigger 
for this previously undocumented growth is cru- 
cial to understanding the sensitivity ofice sheets 
to external drivers and internal feedbacks. 
Previous studies have tended to focus on the 
magnitude of sea-level decline during the Last 
Glacial Maximum (LGM), rather than the tim- 
ing of it, with the general consensus being that 
sea level was about 125-135 m lower than it is 
today**. Geological records that shed light on 
the detailed timing of sea-level change during 
this period are notoriously difficult to obtain, 
not least because they are often located about 

100 m underwater. To address this problem, 

Yokoyama et al. analysed a ‘staircase’ of relict 

coral reefs that once formed part of the Great 

Barrier Reef at the edge of the Australian 

continent — analysis of the recovered corals 

indicates the sea level at the time they were 
alive, whereas the reef age was determined 
using radiometric dating techniques. 


One issue with such studies is that modern 
corals live at a range of depths, creating con- 
siderable uncertainty about the depths that can 
be inferred from fossilized samples’. However, 
by piecing together information from adja- 
cent reefs at different depths, Yokoyama et al. 
built up a composite picture of the timing and 
magnitude of LGM sea-level decline off the 
coast of Australia. Their approach is clever 
and simple: they reason that a hiatus in coral 
growth caused by sea-level fall at one site must 
coincide with the development of a second, 
nearby reef farther offshore, as water depths 
became shallow enough to support growth 
at the deeper site (Fig. 1). Moreover, samples 
recovered from the second site indicate that 
the sea-level minimum (the lowstand) was 
short-lived, and that sea level soon began to 
rise again. 

To identify the ice sheets responsible for 
the dramatic sea-level fall, Yokoyama et al. 
turned to a technique known as glacial iso- 
static adjustment (GIA) modelling, which 
can predict how ice-sheet change translates 
into sea-level change®. Although increases in 
ice mass directly equate to decreases in ocean 
mass, the resulting pattern of sea-level change 
is not straightforward. This is because the land 
beneath ice sheets subsides as mantle material 
is displaced by the weight of the accumulating 
ice, and rebounds when the ice melts; an equiv- 
alent process takes place across ocean basins 
in response to increases or decreases in ocean 
mass. The changes in the shape of the solid 
Earth alter the shape of the planet's gravity 
field. And because the gravity field defines the 
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50 Years Ago 


Compensation of £3,200 has so 

far been paid by the Ministry 

of Technology for the damage 
caused by the four sonic boom 

tests carried out over London last 
summer with Lightning aircraft. 
The other seven tests over other 
parts of Britain cost just over £700 
in compensation ... With one 
exception, all the claims involved 
damage to property. The exception 
was a claim made by a woman from 
Hornchurch who suffered a partial 
loss of hearing ... and accepted £150 
settlement. Even before the tests, 
the British Government seems to 
have recognized that aircraft should 
not be allowed to fly at supersonic 
speeds over densely populated 
areas ... It seems inevitable that all 
supersonic flight over land will be 
prohibited, at least until a great deal 
more information on the effect of 
sonic booms has been collected. 
From Nature 27 July 1968 


100 Years Ago 


Anarticle on coal-saving ... appears 
in Engineering for July 12. The 
author ... gives average figures for 
250 typical steam-boiler plants, 
covering ... 1910 to the present 
time. It is estimated that 58,500,000 
tons of coal per annum are used 

in this country for steam-raising 
purposes ... exclusive of 15,000,000 
tons used in railways. The 250 plants 
had a total of 1000 boilers, principally 
of the Lancashire type. With hand- 
firing the average net working 
efficiency is 57.8 per cent., as against 
mechanical firing with an average 
net working efficiency of 61.4 per 
cent. ... The author estimates that 
there are 45,000 to 60,000 steam 
boilers at work in Great Britain, 
calculated in terms of averaged-sized 
Lancashire boilers, and considers 
that all the steam produced in the 
country to-day could be obtained 
much more economically with 25 per 
cent. fewer boilers. 

From Nature 25 July 1918 
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a_ LGM, before 22,000 years ago 


Coral reef 


b 22,000-21,500 years ago 


c Present day 


Ocean loading 


Figure 1 | Rapid changes of sea level recorded by coral reefs. Yokoyama et al.” studied fossil corals at 
the Great Barrier Reef to work out how sea level changed during the Last Glacial Maximum (LGM; 26,500 
to 19,000 years ago' ). a, Before 22,000 years ago, sea level was about 110 metres below the present level 
and the reef was confined to a region close to the edge of the continental shelf. b, Between 22,000 and 
21,500 years ago, a sea-level drop of approximately 20 m altered the position at which coral could grow. 
The original reef died as it became exposed to air, and a new reef began to grow farther offshore. c, The 
sea surface then rose to present-day levels as ice sheets (not shown) melted. As the ocean flooded the 
continental shelf, the extra load of water caused the land to tilt (dotted line indicates original position of 


the continental shelf). 


shape of the sea surface, the resulting pattern 
of sea-level change reflects mass redistribution 
throughout the Earth system, as well as the net 
change in ocean mass. 

Using a priori assumptions about ice-sheet 
changes during the last glacial cycle, GIA mod- 
els can predict the global pattern of sea-level 
change over time. In places such as Australia, 
which are far from major ice sheets, sea-level 
change will be close to the global mean — but 
even so, a correction must still be applied to 
account for the tilting of the continental shelf 
that occurs as it is flooded during post-glacial 
sea-level rise’. By testing different ice-sheet 
scenarios in a GIA model, and drawing on 
evidence from other locations that record the 
sea-level lowstand, Yokoyama et al. tentatively 
propose that growth of the North American ice 
sheet was the main cause of the rapid sea-level 
drop at the end of the LGM. 

Techniques for reconstructing ice-sheet 
changes from sea-level observations rely on the 
fact that sea-level change is not uniform across 
the planet, but depends on proximity to the 
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changing ice sheets. However, the distribution 
of currently available sea-level records is insuf- 
ficient to allow us to determine the precise 
pattern of ice-sheet change during the LGM, 
making it difficult to pinpoint the cause of the 
sudden ice expansion and the reason for its 
brevity. Additional insight could be gained by 
studying sites at which sea-level change will be 
sensitive to changes in nearby ice sheets. 

But the interpretation of such ‘intermediate 
field records would be complex. Most GIA 
models assume that Earth’s properties do not 
vary from place to place, but analyses of the 
speed at which seismic waves travel show that 
this is an over-simplification (see ref. 8, for 
example). Future sea-level-based inferences of 
ice-sheet change should use GIA models that 
incorporate 3D structural variation in their 
representation of the solid Earth. 

Direct evidence of past ice-sheet growth, 
rather than indirect evidence from ancient 
shorelines, would also provide crucial insight 
into the cause of the final LGM sea-level drop 
reported by Yokoyama and colleagues. Such 
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evidence is, however, extremely difficult to 
piece together, because growing ice sheets 
tend to erase evidence of their past extent 
from the landscape. 

Unlike ice-sheet retreat, the mechanisms 
responsible for rapid ice-sheet growth have 
received relatively little attention. It is tempting 
to equate cold periods in Earth’s history to an 
increase in snowfall, but the opposite is actu- 
ally true, because cold air holds less moisture 
than warmer air’. Yokoyama et al. suggest that 
an increase in the amount of solar radiation 
received by the Southern Hemisphere might 
have increased Antarctic snowfall towards the 
end of the LGM, but increased snowfall is not 
documented in ice-core records for this period 
(see ref. 10, for example). 

An alternative explanation of Yokoyama 
and colleagues’ findings is that cold ocean 
temperatures might have led to the expan- 
sion and thickening of floating ice adjacent 
to marine-terminating portions of the major 
ice sheets. Such buttressing ice shelves have 
a key role in reducing the flux of ice into the 
ocean”, and their presence can promote ice- 
sheet growth. But reconstructing the extent of 
ice shelves to investigate this idea will be chal- 
lenging. Moreover, there is much uncertainty 
about how ocean temperature changed during 
the LGM. 

Rather than increasing in thickness, did 
one or more of the ice sheets increase in extent 
towards the end of the LGM? Lower sea lev- 
els and exposed continental shelves would 
have made it easier for ice sheets to advance 
offshore at that time, but growing ice sheets 
depress the land beneath, counteracting the 
effect of the lower sea level and potentially 
leading to ice-sheet instability. Could these 
feedbacks between ice dynamics, the underly- 
ing Earth and the adjacent ocean explain the 
short-lived nature of the final LGM sea-level 
lowstand, or did external climate factors bring 
it to an abrupt end? Whatever the explanation, 
it seems the LGM was more dynamic than 
previously thought. = 
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he tropics are home to a dizzying array of species, 

store vast quantities of carbon, drive year-to-year 

climate variability, and will soon be home to half 
of the world’s population. In this Insight, we explore 
some of the scientific and societal challenges posed by 
the tropics, and some of the solutions being devised. 

Mosquito-borne diseases, such as dengue and malaria, 
are a major cause of morbidity in the tropics. Neil 
Ferguson examines how new interventions offer hope 
of reducing disease burden, but only if supported by 
rigorous trials and epidemiological analyses. 

Ian Gilbert and colleagues explore the challenges 
associated with developing new drugs to tackle infectious 
tropical diseases such as malaria and tuberculosis. Recent 
progress shows that headway can be made with the 
concerted efforts of all stakeholders. 

Non-communicable diseases, such as heart disease and 
cancer, are often considered a problem of the wealthy. 
Majid Ezzati and colleagues show that these diseases 
cause more deaths in low- and middle-income tropical 
countries than in high-income Western countries. 

The tropics harbour more than three-quarters 
of all species. Jos Barlow and colleagues review the 
vulnerability of this hyperdiversity to mounting local 
and global stressors. Only through coordinated local, 
national and international action cana collapse of 
tropical biodiversity be prevented, they argue. 

Ina Review of the tropical forest carbon budget, 
Edward Mitchard suggests that tropical forests are 
probably in the process of switching from making an 
approximately neutral contribution to the global carbon 
cycle to becoming a net carbon source. 

The tropics also set the stage for the El Nifio-Southern 
Oscillation system. Axel Timmermann and colleagues 
synthesize the complexity of this system, and argue that 
its variability is excited by a wide range of mechanisms, 
many of which depend on climatic conditions. 

The tropics are undergoing substantial social, 
environmental and climatic change. Whether an 
equitable, healthy and sustainable future can be forged, 
which fosters the region’s rich ecological inheritance and 
the environmental benefits it bestows, will depend on 
actions taken in the next few years. 
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Clare Thomas, Christina Tobin Karlstrém, 
Michael White 
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Challenges and opportunities in 
controlling mosquito-borne infections 


Neil M. Ferguson!* 


Mosquito-borne diseases remain a major cause of morbidity and mortality across the tropical regions. Despite much 
progress in the control of malaria, malaria-associated morbidity remains high, whereas arboviruses—most notably 
dengue—are responsible for a rising burden of disease, even in middle-income countries that have almost completely 
eliminated malaria. Here I discuss how new interventions offer the promise of considerable future reductions in disease 
burden. However, I emphasize that intervention programmes need to be underpinned by rigorous trials and quantitative 
epidemiological analyses. Such analyses suggest that the long-term goal of elimination is more feasible for dengue than 
for malaria, even if malaria elimination would offer greater overall health benefit to the public. 


eaths from malaria have almost halved since 2000!°, despite 

rapidly growing populations in many endemic regions. 

Improvements in vector control and (to a lesser extent) treatment 
are the principal drivers behind this decline*°, however, development— 
leading to improved housing and increased urbanization across the 
tropics—has also been an important contributing factor®. The declines 
have been particularly marked outside sub-Saharan Africa, with only 
10% of malaria deaths now occurring outside this region’. Over the 
same period, the burden of disease from dengue has risen markedly, 
often in those countries that have the largest reductions in malaria dis- 
ease®® (Fig. 1). Part of the increase in burden reflects population growth 
and urbanization in the tropics, but greater connectivity of human 
populations!” (which leads to all four dengue serotypes now regularly 
being detected in all endemic countries"), entomology (Aedes aegypti, the 
principal dengue vector, is more highly adapted to urban environments’) 
and climate change (increasing the geographical limits of endemic 
transmission!) have also played a part. 

The number of malaria deaths globally remains over 30-fold higher 
than those from dengue (Fig. 1), although this ratio is less extreme if one 
only considers disease burden outside Africa. Furthermore, given the 
availability of highly effective artemisinin-based malaria treatments, a 
large proportion of remaining mortality induced by malaria reflects gaps 
in access to treatment or suboptimal diagnosis*°. Conversely, dengue 
affects urban populations with better (although often not perfect) access 
to healthcare. No effective dengue antiviral drugs or monoclonal antibod- 
ies are currently available for dengue treatment, although improvements 
in case management have led to substantial reductions in case fatality 
ratios in situations in which the current best standard of care is availa- 
ble!+, The unpredictability and explosive nature of dengue epidemics also 
imposes substantial stresses on healthcare systems, and can cause a lot 
of anxiety for the public, particularly in contexts in which the burden of 
disease from most other infections has been markedly reduced in recent 
decades through vaccination and access to treatment. 

Emerging arboviral infections have also caused substantial public con- 
cern in recent years’°. Chikungunya emerged in Latin America and the 
Caribbean in 2013!” followed by Zika in 2015'®. Both caused large-scale 
epidemics over an approximately two-year period, before the accumu- 
lation of population immunity in populations across the region led to 
marked decreases in incidence'?-*!. Despite availability of a highly effec- 
tive vaccine, yellow fever has also caused relatively large, unpredicted 


epidemics in Angola and Brazil in the last two years, necessitating rapid 
large-scale immunization campaigns”. However, the true burden of dis- 
ease caused by both chikungunya and Zika is highly uncertain, in large 
part because of a lack of systematic surveillance across much of the world 
and, for Zika, the very mildly symptomatic nature of most infections. The 
health consequences of infection with both viruses remain poorly charac- 
terized and it is unclear to what extent either virus can be considered truly 
endemic in the human population globally, or whether these viruses, like 
yellow fever, are dependent on a sylvatic cycle of transmission. These gaps 
in our knowledge!® make assessing the public health need for effective 
interventions highly challenging. 

The perceived increasing threat (and disease burden) posed by arboviral 
infections and the recent global emphasis on reducing the disease burden 
of malaria has led to increasing investment in the development of new 
interventions and the intensification of current vector control in many 
endemic low- and middle-income countries (LMICs). The first vaccines 
for both malaria and dengue offer some promise for disease reduction 
but their imperfect and complex efficacy profiles mean that neither rep- 
resents a panacea and their uptake has therefore been slow”*-?”. However, 
the accelerated roll-out of long-lived insecticide-treated nets (LLINs) 
across sub-Saharan Africa over the last decade has led to major declines 
in malaria incidence, with improved treatment further reducing disease 
burden‘. This is not to imply that these gains can be taken for granted— 
insecticide resistance** and/or failure to sustain intervention coverage pose 
risks of rapid bounce-back. However, by contrast, the evidence that current 
vector control measures for dengue are having significant impact is limited 
at best->*, Although differences in the ecology of the principal vectors 
(for example, in landing periodicity**) for the two classes of infections 
clearly affect the effectiveness of different interventions, I propose that 
the failure of dengue control to date is principally a consequence of intrin- 
sic differences in the epidemiology of arboviral and malaria infections. 
These differences necessitate fundamentally different goals for control 
policy planning for these two classes of infections. I will then review how 
new technologies for vector control that are currently under development 
offer the potential to deliver pronounced reductions in disease burden and 
potentially even elimination in the coming 10-20 years. 


Goals of vector control 
For all infections, the goals of control are the reduction of dis- 
ease incidence and, preferably, transmission. Elimination requires 
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Fig. 1 | Mortality trends between 2000 and 2016 for malaria and dengue. 
WHO estimates shown for malaria, combining estimates from the 2015 
and 2017 World Malaria Reports*’; other estimates!® are substantially 
higher but show the same overall trends. In the absence of WHO estimates, 
estimates of dengue-associated mortality from the 2016 Global Burden of 
Disease study are shown’. Top, malaria deaths in Africa. Bottom, dengue 
deaths and malaria deaths outside Africa. 


reducing transmission to levels that are less than self-sustaining: in the 
parlance of mathematical epidemiology, reducing the basic reproduc- 
tion number (the number of secondary infections caused by a typical 
index infection in a naive population), Ro, to below 1. Given the lack of 
vaccines with high efficacy for all mosquito-borne infections other than 
yellow fever and Japanese encephalitis, the main goal of public health 
interventions for these infections has been to reduce human expo- 
sure through vector control. The rationale for malaria is clear: in high 
transmission areas, a child might be infected multiple times in a year, 
with each new infection posing a risk of disease. Reducing exposure 
by a certain proportion then leads to an immediate and comparable 
reduction in the incidence of disease—a reduction that can be further 
improved by improving access to effective antimalarial therapy. Clinical 
immunity nuances this picture but does not fundamentally change the 
broad conclusion—although exposures at a later age have a lower risk 
of disease (severe disease particularly), this risk declines gradually, and 
both age and exposure have a role in this gradual decline™*. 

The same rationale does not hold true for arboviral infections. Unlike 
malaria (although controversy remains about the extent to which that 
pathogen can be viewed as a collection of semi-independent antigeni- 
cally diverse strains*»), all arboviruses are thought to generate neutral- 
izing homotypic immunity after infection; this means that individuals 
can experience only one infection with each virus in their lives. Dengue 
is composed of four distinct (but immunologically cross-reactive) viral 
serotypes, and therefore four infections are possible. However, second- 
ary dengue infections are responsible for the great majority of severe 
disease***°, with tertiary and quaternary infections thought to be 
largely asymptomatic”. The immunizing nature of arboviral infections 
has profound consequences for vector control. Completely preventing 
an individual from being exposed in one year has no impact on life- 
time disease risk if a high-level risk of exposure resumes the following 
year—the only effect of such transient interventions is to postpone 
infection. The same reasoning applies to partially effective controls. 
Consider a high transmission setting in which individuals have a 20% 
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Fig. 2 | Comparative effect of vector controls on incidence of clinical 
malaria and hospitalized dengue in high transmission intensity 
settings. a, Published mathematical transmission dynamic models of each 
disease”*” are used to show the long-term effect of varying (i) coverage 
levels of LLINs for malaria in a sub-Saharan African hyperendemic setting 
with an (assumed constant) entomological inoculation rate in the absence 
of controls of 500 (Ro of approximately 700); (ii) the proportion of dengue 
exposure blocked by a hypothetical dengue vector control measure (100% 
coverage assumed) in a setting in which seroprevalence in 9-year olds is 
80% on average before the introduction of controls (Ro of approximately 4). 
b, The temporal impact of controls on annual disease incidence. Controls 
are introduced in year 20 (50% LLIN coverage assumed for malaria, 

50% effective controls for dengue). Inter-annual climate variation is not 
included in either model, so malaria incidence is constant over time before 
interventions. Dengue incidence varies markedly year-to-year due to semi- 
chaotic serotype cycling. 


risk of dengue infection each year. On average, children will be 5-years 
old when they experience their first infection, and about 12-years old 
when they experience their second. Imagine a vector control interven- 
tion that reduces exposure by 50%—thus reducing the infection risk 
to 10% per year. The net long-term effect of this intervention is only to 
increase the age at which individuals experience their first and second 
infections—to 10 and 23 years, respectively, for this example. This may 
paradoxically increase overall disease burden, if dengue disease severity 
increases with age”). 

This argument can be formalized; Fig. 2a shows the relationship 
between disease risk and the effectiveness of vector controls at reducing 
exposure predicted by previously validated mathematical models of 
dengue and malaria transmission. This malaria model has been used 
extensively to inform control planning, and was validated against his- 
torical prevalence and incidence data both in the absence and presence 
of control measures’”-“*, The dengue model was previously fitted to 
the Sanofi-Pasteur dengue vaccine trial data and used to explore the 
potential impact of large-scale use of the Sanofi-Pasteur vaccine and 
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Wolbachia?***. For malaria, reductions in disease increase almost line- 
arly with coverage. For dengue, the response curve is highly nonlinear, 
with marginal reductions in lifetime disease risk until the level of expo- 
sure reduction is sufficient to reduce Ry close to 1 (around 70% effec- 
tiveness, for this example). I conclude that for vector controls against 
dengue (or other arboviruses) to have a major long-term impact, they 
must come close to stopping sustained transmission—that is, achieving 
elimination. Figure 2a also highlights that elimination, in theory, should 
require considerably less effort for dengue than for malaria—in high 
transmission settings, Ro for dengue is around 4, whereas for malaria 
it is over 100 (with a large degree of local geographical heterogeneity 
for both infections****’). Therefore, vector control for dengue needs 
to reduce exposure by 80% to achieve elimination, but for malaria the 
reduction required is over 99%. 

However, focusing only on the long-term effects of interventions 
neglects the transient temporal effects that the large-scale introduction 
of an intervention may have on transmission (simulated in Fig. 2b using 
the models used to generate Fig. 2a). Short- and long-term impacts 
differ less for malaria than dengue because of the more limited effect 
of host immunity in modulating disease transmission. The loss of clin- 
ical immunity largely explains the rebound in malaria in Fig. 2b. But 
herd-immunity is fundamental to dengue transmission dynamics; in 
endemic areas, at any point in time much of the population is immune 
to any one serotype, so when a new epidemic occurs, it only affects 
the minority of the population (typically children) who have not yet 
acquired immunity. Over time, a dynamical equilibrium is reached 
between viral transmission rates and the level of population immunity, 
which leads to the effective reproduction number (the average number 
of secondary infections caused by a typical index case in the presence 
of population immunity), R, to hover around 1. An exception to this 
is found in marginally endemic areas that have not yet been affected 
by all serotypes, where the initial epidemic following invasion with a 
novel serotype can be much larger than typical“; however, following 
such an initial epidemic, the resulting immunity also causes R for the 
invading serotype to fall below 1. 

In this context, suddenly introducing a new intervention population- 
wide disrupts this equilibrium, even if the reduction in transmission 
achieved is insufficient to cause long-term elimination. An intervention 
that achieves a sustained 20% reduction in exposure would initially 
reduce R from 1 to 0.8—leading to a temporary cessation of transmis- 
sion. New births into the population then gradually reduce population 
immunity, increasing R, and sustained transmission would resume 
once R once again reached 1. With an annual birth cohort size of, for 
example, 2% of the total population, it would take up to 10 years for 
R to increase from 0.8 back to 1. Stopping dengue epidemics for such a 
time clearly sounds like an impressive outcome, but it is important to 
note that it would not necessarily lead to a reduction in lifetime disease 
risk for individuals in the affected population unless more effective 
interventions could be introduced before the end of the 10-year interim 
period. 

Transient impacts on transmission also need to be taken into account 
when interpreting data from cluster-randomized trials of vector con- 
trol interventions. The effect size that is observed in trials of trans- 
mission-reduction interventions (such as community mobilization*’) 
over timescales of 1 to 2 years will implicitly include the large transient 
effect on incidence described above, and thus short-term measured 
effectiveness (for example, reduction in infection rates over one year) 
would be expected to be considerably greater than the underlying long- 
term effectiveness of the intervention (as quantified by the fractional 
reduction in the achieved Ro). 

Most of the above arguments around the goals and probable impact 
of interventions against dengue also hold for chikungunya and Zika, 
albeit the period between epidemics is much longer and more unpre- 
dictable than for dengue!®. There is one exception, that is, protecting 
vulnerable populations for a short period. Congenital Zika syndrome 
is predominantly associated with maternal exposure to Zika in the 
first trimester of pregnancy*”? ' Therefore, an intervention that targets 
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individuals (for example, spatial repellents) that reduces exposure for a 
three-month period might achieve a substantial reduction in the bur- 
den of disease caused by a Zika epidemic, even if that intervention has 
a minimal effect on community levels of transmission. However, just 
as vector controls that target community transmission require rigorous 
assessment in randomized trials, so do interventions that aim to reduce 
personal exposure. 

The high level of geographical, environmentally driven heterogeneity 
in the Ro of mosquito-borne infections provides nuances, but it does 
not invalidate these arguments. However, such heterogeneity—and 
the existence of hotspots with high transmission rates—increases the 
challenges involved in achieving elimination, albeit with the upside 
that control in areas with lower transmission rates may be easier 
than expected from geographically coarse estimates of transmission 
intensity. 


Assessing effectiveness of vector control measures 

Why then has there been so little recent success in controlling dengue 
(especially in LMICs), whereas major reductions in disease for malaria 
have been achieved? Singapore (which is not a LMIC) is perhaps 
the only formerly hyper-endemic country in recent decades to have 
achieved high levels of dengue control (although Cuba achieved similar 
success in the 1970s)°*. However, even Singapore has experienced 
increases in the incidence of dengue incidence in recent years asso- 
ciated with declining levels of population immunity>’. Nevertheless, 
substantial reductions in malaria transmission have been achieved in 
some of the poorest countries of the world in the last decade*®. 

Part of the answer lies in differences in vector ecology. Anopheles 
gambiae, the primary vector of Plasmodium falciparum in sub-Saharan 
Africa, principally bites at night (and therefore inside houses), which 
means that LLINs**°> (and indoor residual spraying (IRS)°®°’) are 
highly effective at reducing human exposure. A. aegypti bites during the 
day (with peaks of activity in the morning and afternoon) both inside 
and outside houses**. Furthermore, the mating patterns and breeding 
site preferences of A. aegypti are better adapted to urban landscapes”™ 
than most anopheline species (although there is evidence of urban 
adaptation of the latter®”). 

However, while the ecology of A. aegypti limits the effectiveness of 
‘simple’ interventions such as bednets”’, vector control for arboviruses 
has also suffered from a profound lack of rigorous evidence to sup- 
port the effectiveness of the individual measures that are currently in 
use”??? In contrast to malaria, for which large cluster-randomized 
trials with human infection endpoints generated the necessary evidence 
base to support the scale-up of LLIN and IRS use, randomized trials 
of vector control approaches for dengue are mostly absent, or have 
typically been underpowered and only measured entomological rather 
than epidemiological (that is, human disease) endpoints*. ‘Integrated 
vector management is the recommended approach for vector control 
of both malaria and dengue, defined as “a rational decision-making 
process for the optimal use of resources for vector control”®. Ironically, 
the evidence does not currently exist to make rational decisions for 
dengue vector control, caused by a lack of trial data that support the 
effectiveness of current control measures at reducing dengue disease. 

Most of the current activities to develop vector control strategies 
against dengue across the tropics are therefore driven more by the 
understandable hope that reducing mosquito numbers will only benefit 
public health, than by a quantitative evidence base that directs invest- 
ments in vector control towards intervention policies that will result in 
a substantial public health impact. Furthermore, any impact that such 
interventions might have is often decreased by a tendency of public 
health authorities to implement vector control measures in response to 
ongoing dengue epidemics, rather to sustain consistent control all year 
round. For the reasons outlined in the previous section, reactive short- 
lived interventions will at best protect individuals transiently, but will 
leave them susceptible to infection during the next epidemic. Again, 
Singapore is perhaps the exception—both in terms of the intensity of 
vector control activities (and the resources made available to sustain 
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these activities) and the mostly consistent, sustained nature of their 
implementation*’. 

In recent years, these shortcomings have been increasingly rec- 
ognized, culminating in the Vector Control Advisory Group of the 
WHO (World Health Organization) issuing new guidance that specifies 
that data from randomized trials with epidemiological endpoints will 
be required for WHO to recommend new interventions in future®!. 
Furthermore, a number of studies have provided improved guidance on 
appropriate trial design for dengue vector control trials*”®’, highlight- 
ing the need for trials to be sufficiently powered to allow for the high 
degree of spatiotemporal heterogeneity in the incidence of vector-borne 
diseases, to utilize clusters of sufficient size and spacing to minimize 
contamination or boundary effects, to have a sufficiently long period 
of follow-up and to measure epidemiological outcomes. Meeting these 
requirements is without a doubt challenging, and necessitates that trials 
are considerably larger (and consequently more expensive) than com- 
parable trials for malaria for which higher and more stable infection 
rates mean that trials can have a smaller scale. 

Indeed, only one dengue vector control intervention study—of 
the ‘Camino Verde’ approach to community mobilization*—has 
approached best practice in trial design. However, even in that 
case, extrapolation of the study results to predict likely long-term, 
large-scale effectiveness of the intervention is challenging. The 
small scale of individual clusters (140 households) means that the 
measured effectiveness may have been affected by individuals living 
in intervention areas being exposed to infection outside those areas. 
This ‘contamination’ issue could indicate that the 25-30% reduction 
in incidence that is seen in the trial underestimates the true effect 
size that would be seen if the intervention is implemented at larger 
scale. Conversely, for the reasons outlined in the previous section, the 
effect size measured in that trial (and all other short-term trials) over 
the approximately one-year period of follow-up would be expected 
to incorporate a short-term perturbative effect of the intervention 
on dengue transmission, and may therefore overestimate long-term 
effectiveness. 

Assessing the likely long-term impact of interventions from rela- 
tively short-term trial data are therefore challenging, not only to adjust 
for contamination and the typically short-term period of follow-up, 
but also because the transmission intensity (Ro) of all mosquito-borne 
infections show high levels of spatiotemporal variation***°’. Therefore, 
the effect size seen in one context may overestimate the effectiveness 
of the same intervention applied in a higher transmission intensity 
context, or underestimate the effects of the intervention in a lower 
transmission intensity setting. Repeating studies in a range of contexts 
can mitigate this issue, but this is costly. Mathematical modelling of 
each trial incorporating details of the transmission context, period of 
follow-up and cluster size is therefore required to derive estimates of 
effectiveness that can be used to predict the likely long-term impact 
of large-scale use of novel interventions across a variety of transmis- 
sion contexts. Such analyses are not straightforward (and are not a 
substitute for long-term follow-up data on effectiveness), but have 
been undertaken for LLINs and IRS for malaria**“4, and for both the 
CYD-TDV (also known by the trade name Dengvaxia, developed by 
Sanofi—Pasteur) dengue”*”* and RTS,S (also known by the trade name 
Mosquirix, developed by GlaxoSmithKline)?°°® malaria vaccines, and 
have been planned for the analysis of trials of Wolbachia as a dengue 
control measure*”®. 

Intervention trials for Zika and chikungunya—whether of vector 
controls, vaccines or treatment—are even more challenging than for 
dengue®, because of the longer inter-epidemic period and the highly 
unpredictable nature of epidemic timing. The traditional trial design 
of recruiting cohorts of participants, randomizing to intervention or 
control and monitoring outcomes is therefore likely to be prohibitively 
expensive—owing to the very large numbers of participants and sites 
needing to be included. Although model-based analysis of available 
surveillance data can improve the efficiency of site selection’, reactive 
designs may be more efficient long-term®. Such a trial might involve 
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gaining ethical and regulatory approval to proceed in advance for a 
large number of potential trial sites, but only triggering recruitment of 
participants and intervention implementation at a site once transmis- 
sion is detected there. That said, given their shared vectors and similar 
transmissibility, vector control interventions that show high efficacy 
against dengue are likely to also show comparable efficacy against Zika 
and chikungunya. 


The promise of new interventions 

Although the large-scale roll-out of LLINs (together with targeted use of 
IRS) across sub-Saharan Africa in the last decade has had a major effect 
on both malaria transmission and disease, even very high coverage 
levels of these interventions and effective treatment are predicted to 
be insufficient to eliminate malaria from the highest transmission 
settings’”“*, Insecticide resistance also poses a growing threat”®. Mass 
drug administration can have a very large short-term effect on malaria 
prevalence® (by reducing the parasite reservoir in humans), but unless 
administration at high coverage levels is repeated indefinitely, such 
effects are transient, since transmission quickly restarts once infection 
is reimported from untreated areas”, Similarly, seasonal malaria chem- 
oprevention can be highly effective at reducing disease incidence”’ but 
is only appropriate for moderate to high transmission settings with 
highly seasonal transmission. Furthermore, both mass drug adminis- 
tration and seasonal malaria chemoprevention pose the risk of accel- 
erating the development of wide-scale artemisinin drug resistance in 
the parasite population. 

For dengue, the situation is worse. As described above, there are 
currently no interventions in large-scale use in LMICs that are likely 
to be having any major impact on dengue transmission or disease 
burden*’. The intensive vector control interventions adopted largely 
successfully by Singapore (and high-income settings such as Florida 
and Queensland) are not easily translated to most LMICs and are likely 
to be unaffordable for those countries. 

In addition to further evaluation and scale-up of current interven- 
tions, new intervention technologies are therefore needed to make 
elimination a feasible policy goal for either dengue or malaria. Over 
the last two decades, major investment and effort has been committed 
to the development of both vaccines and novel vector controls, with 
varying degrees of success. 


Vaccines 

The CYD-TDV dengue and RTS,S malaria vaccines, the first to be 
licensed for either infection, are both the result of over twenty years 
of development effort. Past development efforts have been hindered 
by the antigenic/immunological complexity of the pathogens and the 
lack of an obvious commercial market that is sufficient to justify devel- 
opment costs. 

In the phase III trial, over two years of follow-up, RTS,S showed 28% 
(95% confidence interval, 22-33%) efficacy against clinical malaria 
in children who received three doses over a three-month period, and 
36% (95% confidence interval, 31-41%) efficacy in children who 
additionally received a month-20 booster dose”. Efficacy was a little 
lower in infants and against severe malaria. Data from phase II and III 
studies and associated mathematical modelling of these data demon- 
strated waning of efficacy over time?*”’, which was correlated with a 
decrease in anti-circumsporozoite antibody titres®. This modelling 
also showed that anti-circumsporozoite antibody titres were a corre- 
late of vaccine-induced protection®*®. The model of vaccine action 
proposed by these analyses also explained the observed negative corre- 
lation between vaccine efficacy and the transmission intensity seen at 
trial sites, and the faster decay of efficacy seen in higher transmission 
settings. 

For CYD-TDV, two large phase III studies (in Asia and Latin 
America) both showed approximately 60% efficacy against virolog- 
ically confirmed clinical dengue disease in the one year following 
completion of a three-dose vaccine schedule””*. Efficacy varied by 
serotype, increased with age and was higher for severe dengue. In the 
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immunological subset of trial participants for whom dengue serological 
status was measured before the first dose, vaccine efficacy was approx- 
imately 75% in individuals who were dengue seropositive, but much 
lower (and non-significant) in individuals who were dengue seroneg- 
ative. However, in the first year of long-term follow-up, a statistically 
significant excess risk of hospitalized dengue disease was observed in 
vaccine recipients in the youngest age group (2-5 years old). Since no 
evidence of excess risk was seen in children over 9 years of age in either 
trial, Sanofi-Pasteur proceeded with submitting the vaccine for regu- 
latory approval with an age indication of use only in children over 9. 
The most parsimonious and plausible hypothesis that explains these 
results is that the vaccine acts akin to a silent natural dengue infection”. 
Seronegative recipients of the vaccine are immunologically primed 
(akin to natural primary infection), so that their first breakthrough 
natural infection has the higher severity associated with natural 
secondary infection in unvaccinated individuals. Conversely, seroposi- 
tive recipients of the vaccine will have their antibody titres against all 
four serotypes boosted to the high levels seen after secondary infection 
in unvaccinated individuals—leading to the first breakthrough infec- 
tion having the low severity associated with natural tertiary infection. 

Mathematical modelling was key to extrapolating from the trial 
results to assess the likely public health impact of large-scale use of 
both vaccines, and these analyses had a key role in informing WHO 
recommendations. For RTS,S, modelling predicted that the vaccine 
could prevent 1 malaria death per 200 vaccine recipients in moderate 
to high endemicity settings, making it highly cost-effective compared 
with many other vaccines”®, albeit less cost-effective than LLINs”. 
WHO recognized the notable potential public health benefits offered 
by the vaccine, but because of the potential difficulty of delivering a 
four-dose vaccine schedule and in light of the meningitis safety sig- 
nal, recommended that large-scale pilot implementation programmes 
should be conducted”®, which are now due to start this year in Ghana, 
Kenya and Malawi. 

For CYD-TDV, modelling indicated that large-scale vaccination 
might reduce the incidence of symptomatic and hospitalized dengue 
disease by up to 25% in high transmission intensity settings. However, 
vaccination was predicted to potentially increase the incidence 
of hospitalized dengue in low transmission intensity (and thus 
low seroprevalence) settings?>?*. WHO recommendations 
reflected this risk, suggesting that seroprevalence surveys across the 
population should be undertaken to assess transmission intensity 
before vaccination roll-out, and that the vaccine should only be used 
in settings in which over 70% of vaccine recipients were likely to be 
seropositive’®. These recommendations have now changed to recom- 
mend vaccination only in individuals who test seropositive in light of 
recent data collected by Sanofi—Pasteur that has conclusively demon- 
strated that seronegative recipients of all ages experience a higher risk 
of hospitalized dengue disease throughout the long-term follow-up 
of the trials””. 

A number of next-generation dengue and malaria vaccines are cur- 
rently under development. Two other tetravalent live-attenuated den- 
gue vaccines are currently in phase III trials, with initial results due in 
the next 12 months. It is unclear whether either will also pose risks of 
use in seronegative recipients that are similar to CYD-TDYV, but simi- 
larities in the immunogenicity profiles of all three vaccines makes this 
a possibility’”®. A variety of next-generation pre-erythrocytic, blood- 
stage and transmission-blocking malaria vaccines are under clinical 
development”, but none have yet entered phase ITI studies. For both 
infections, it is therefore arguably unlikely that vaccines will become 
available in the next decade that can on their own offer the promise of 
disease elimination. However, that is not to say that vaccines will not 
have an impact in that time frame: both current vaccines and next gen- 
eration late-stage candidates may make a considerable contribution to 
reducing disease burden, combinations of vaccines targeting different 
parasite life stages may offer synergistic levels of protection against 
malaria, and some potential exists for the development of ‘universal 
dengue vaccines®. 
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Fig. 3 | Mechanism of action of Wolbachia and the Y-linked ‘X-shredder’ 
homing-endonuclease gene-based gene drive control measures. 

a, Wolbachia-infected mosquitoes are refractory to arboviruses and are 
able to invade wild-type populations owing to cytoplasmic incompatibility, 
which leads to crosses between wild-type females and Wolbachia-infected 
males that are non-viable. b, A Y-linked X-shredder homing-endonuclease 
gene (HEG) that induces male mosquito survival biases the sex ratio, since 
a high proportion (here shown as all) of progeny in their crosses with wild- 
type females are male and will inherit the homing endonuclease genes. 

c, Illustrative invasion dynamics (releases occurring on day 50). Wolbachia 
exhibits frequency dependent invasion dynamics, with a threshold 
frequency determined by fitness costs of infection (here assumed to be 
30%). Therefore, invasion can occur when the release size equals 50% 

of the resident wild-type population, but not at 20%. Mosquitoes that 
carry Y-linked X-shredder homing endonuclease genes can invade at any 
frequency (here assumed to be 5% of resident population size) and cause 
the adult mosquito population to crash. 


Novel vector control technologies 
New insecticides and delivery systems continue to be developed; 
and perhaps the most interesting and potentially transformative are 
those interventions that target obligate life-cycle stages such as sugar 
or blood feeding. Although not a new idea, attractive toxic sugar bait 
technology*!—which targets sugar-feeding and therefore potentially 
increases mortality in both male and female mosquitoes—has been 
advancing rapidly in recent years, with very promising results (at least 
in anopheline species) seen in recent small-scale trials®’. Similarly, 
ivermectin (and newer longer-lived mosquitocidal drug candidates) 
could cause substantial suppression of mosquito populations and thus 
malaria transmission if used as part of a mass treatment intervention™. 
However, the approaches to vector control that offer the potential of 
transformative impacts are those that may give long-term (or even per- 
manent) reductions in disease transmission after only a single imple- 
mentation period, either by reducing mosquito density or reducing 
vector competence. Two such technologies are under active develop- 
ment: Wolbachia and gene-drive approaches to genetic modification 
of mosquito species (Fig. 3). 

Wolbachia is a genus of bacteria that naturally infects many insects®, 
strains of which have been deliberately transfected into A. aegypti 
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mosquitoes. Wolbachia typically transmits vertically by achieving 
high densities in insect eggs. It confers a frequency-dependent fitness 
advantage relative to uninfected insects through a mechanism of cyto- 
plasmic incompatibility, which results in crosses between Wolbachia- 
infected male insects and wild-type females (which would otherwise 
result in uninfected progeny) that are unviable (Fig. 3a). As a biological 
vector control measure, this offers the advantage that following initial 
releases of Wolbachia-infected mosquitoes into a wild-type popula- 
tion, the frequency of Wolbachia infection will rise to very high levels 
as the released mosquitoes interbreed with wild-type insects (Fig. 3c). 
Initially, Wolbachia was envisaged as a means to reduce mosquito den- 
sity, by using a strain (wMelPop) that imposed life-shortening fitness 
costs on mosquito hosts. However, Wolbachia-infected A. aegypti were 
also found to have substantially lower vector competence for a broad 
range of arboviruses®, including dengue®. It is this phenotype that is 
being exploited—using the less pathogenic wMel strain—by the World 
Mosquito Program (https://www.worldmosquitoprogram.org/), the 
leading development project for Wolbachia technology in A. aegypti. 
Mathematical modelling of data from experimental dengue-infection 
studies in wMel-infected and wild-type A. aegypti suggests that success- 
ful large-scale release and establishment of wMel-infected mosquitoes 
could reduce dengue transmission intensity (Ro) by 75%®’—sufficient 
to achieve elimination of dengue transmission for decades in even 
high transmission settings, and permanent elimination in low to 
moderate transmission settings”. Following an extensive programme 
of small-scale releases, a cluster randomized trial with epidemiologi- 
cal endpoints is now underway in Yogyakarta, Indonesia, with larger 
non-randomized pilot release studies ongoing in Medellin, Colombia 
and Rio de Janeiro, Brazil (World Mosquito Program (https://www. 
worldmosquitoprogram.org/)). Although at a much earlier stage, 
Wolbachia may also have some potential as a malaria control measure®. 

Gene-drive systems for genetically modified mosquitoes*”” offer 
similar advantages to Wolbachia in potentially allowing finite releases 
of modified mosquitoes to invade wild-type populations and rise to 
high frequencies. A variety of gene drive systems have been explored 
experimentally and theoretically?!"’, but the two most developed 
technologies are homing endonuclease genes and CRISPR-Cas9. Both 
rely on homology-dependent repair to be copied from one chromo- 
some to its homologous chromosome during mitosis of germline cells 
or meiosis®””, In the vector control context, most attention to date has 
been given to engineering constructs that supress mosquito popula- 
tions. Biasing the sex ratio towards males is one approach to achieving 
this. So-called X-shredder constructs achieve this by expressing 
endonucleases that are only expressed during spermatogenesis and that 
cleave the X chromosome, rendering it non-functional”. Inserting such 
constructs into the Y chromosome would generate a highly invasive 
genetically modified mosquito that in theory could be introduced into 
a wild-type population once and would eventually drive that popu- 
lation to extinction®*** (Fig. 3b, c). In Aedes aegpti, targeting male- 
determining factors using germline Cas9 expression could give similar 
results®!*®, although research is at an earlier stage compared with 
anopheline systems. Development of constructs that reduce vector 
competence is another area of active research®, and a variety of poten- 
tial targets have been identified for both malaria®””* and arboviruses”™. 

Both Wolbachia and gene-drive technologies face several challenges. 
First, gaining public acceptance and regulatory approval requires a rig- 
orous risk assessment/management process and intensive stakeholder 
engagement. Such barriers are clearly higher for genetically modified 
organisms than purely biological control measures such as Wolbachia. 
Second, use of these interventions will impose intense selection pres- 
sures that are likely to drive the evolution of resistance in either the 
target pathogen (in the case of Wolbachia or vector competence gene 
drive constructs) or the vector (for population suppression genetic 
constructs, and perhaps for Wolbachia), and the long-term pheno- 
typic stability of Wolbachia in A. aegypti is yet to be determined. 
Although several modelling studies have been undertaken, knowl- 
edge gaps in vector ecology (for example, regarding the intensity of 
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density-dependent regulation of larval populations, the extent and 
nature of overwintering mechanisms) make prediction of the impact 
of these interventions uncertain, especially for gene-drive systems. 


Towards elimination 

Although current interventions have proven insufficient so far, the 
novel vector control technologies described above may make elimi- 
nation of dengue or malaria a feasible goal, even in the highest trans- 
mission settings. However, despite the greater progress in controlling 
malaria than dengue in recent decades, the scale of the challenge to 
eliminate malaria from the highest transmission hyperendemic settings 
is larger. The Ro of dengue probably rarely exceeds 6, which means that 
reducing transmission by approximately 85% should be sufficient for 
permanent elimination in nearly all settings. Plus, for the reasons out- 
lined above, even interventions that achieve a (sustained) 30% reduc- 
tion in Ro should cause marked reductions in dengue incidence for 
decades if applied at sufficient scale—buying time for more effective 
interventions to be developed. By contrast, the Ro for P. falciparum is 
in the hundreds in the highest transmission settings, requiring a >99% 
reduction in transmission intensity to achieve elimination. In addition, 
there are multiple Plasmodium species that are capable of causing dis- 
ease in humans, and multiple competent vector species even outside 
the A. gambiae complex. Nevertheless, even if malaria elimination (and 
eventual eradication) remains a more distant goal, the vector tools 
that are currently under development offer the promise of delivering 
order-of-magnitude reductions in transmission and disease. 

While elimination may be the long-term goal of control programmes, 
disease reduction must remain the priority in the short to medium 
term. In this context, a wider range of current and near-to-market 
interventions are relevant—both vector control measures and vaccines. 
One of the greatest challenges will be to make optimal use of limited 
resources (especially in low-income settings) to deliver the greatest 
public health impact. Rigorous epidemiological analysis and mathemat- 
ical modelling will be key to ensuring such optimal deployment—for 
the extrapolation of clinical trial data to predict population impact of 
each intervention in a wider range of settings and in combination with 
other control measures. Rigorous monitoring and evaluation is also 
essential to evaluate the real-world effectiveness of interventions. The 
epidemiology, vector ecology and transmission intensity of both arbo- 
viruses and malaria is highly geographically variable, requiring inter- 
vention policies to be tailored to the local environment. Modelling has 
been pivotal in facilitating the characterization of this heterogeneity*”*. 

There is unlikely to be a single ‘silver bullet’ intervention that is 
sufficient to control either class of infections, so policy formulation 
will require setting-specific selection of interventions from the overall 
portfolio of available tools that have good evidence supporting their 
effectiveness. Such optimization should account for the potential syn- 
ergistic benefits of combining interventions with different mechanisms 
of action (for example, vector control and vaccines). Intervention effec- 
tiveness is only one of the criteria relevant to making that selection— 
cost, ease of delivery and public acceptability (and thus achievable 
coverage) are equally important. Such optimization is now possible 
using integrated disease transmission and health economic models**”4, 
although reliable surveillance and intervention effectiveness data 
are essential to calibrate such models. Last, data analytics (spanning 
dynamical modelling, statistical analysis and machine learning) will 
become increasingly critical to cope with the ever-growing volumes of 
surveillance, genomic, remote sensing and other (for example, mobile 
phone!) data becoming available—to synthesize multiple data streams 
and derive actionable insights to inform public health policy-making. 
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Challenges and recent progress in drug 
discovery for tropical diseases 
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Infectious tropical diseases have a huge effect in terms of mortality and morbidity, and impose a heavy economic burden 
on affected countries. These diseases predominantly affect the world’s poorest people. Currently available drugs are 
inadequate for the majority of these diseases, and there is an urgent need for new treatments. This Review discusses 
some of the challenges involved in developing new drugs to treat these diseases and highlights recent progress. While 
there have been notable successes, there is still a long way to go. 


countries (LMICs). Symptoms include impaired cognitive and phys- 

ical development in children (Box 1), complications in pregnancy, 
fever, nausea, diarrhoea, dehydration, anaemia, rashes, lesions, deform- 
ities, blindness, organ failure, haemorrhage, neurological problems, 
seizures and coma, and, in many cases, death. The WHO (World Health 
Organization) estimated that the diseases listed in Fig. 1 and Table 1 were 
responsible for more than 4 million deaths in 2015’ and the loss of more 
than 250 million disability-adjusted life years (DALYs) owing to ill-health, 
premature disability or early death”. 

Infectious tropical diseases include those defined by the WHO as 
neglected tropical diseases* (NTDs). In addition, diseases such as 
malaria, tuberculosis, HIV/AIDS, multi-drug-resistant Gram-negative 
bacterial infections, diarrhoea (from a variety of pathogens) and hep- 
atitis disproportionately affect tropical countries. The most common 
tropical diseases, grouped by causative agent, include: (i) viruses: Ebola, 
HIV/AIDS, Lassa fever, Marburg virus disease, Rift Valley fever, yellow 
fever, Zika disease, and the NTDs chikungunya, dengue fever and rabies; 
(ii) bacteria: bubonic plague, shigellosis, tuberculosis, typhoid fever, 
typhus, disease caused by drug-resistant Gram-negative bacteria, and the 
NTDs buruli ulcer, leprosy, mycetoma, trachoma and yaws; (iii) protozoa: 
cryptosporidiosis, malaria, and the NTDs Chagas disease, human African 
trypanosomiasis (HAT) and leishmaniases (visceral, cutaneous, post- 
kala-azar dermal and mucocutaneous); (iv) fungi: cryptococcosis and 
the NTD mycetoma; (v) helminths or metazoan worms: the NTDs 
cysticercosis, dracunculiasis (Guinea worm), echinococcosis, food- 
bourne trematodiases (clonorchiasis, opisthorchiasis, fasciolosis), 
lymphatic filariasis, onchocerciasis, schistosomiasis and soil-transmitted 
helminthiases (ascariasis, hookworm, trichuriasis, strongyloidiasis). 
Co-infection with these diseases is commonplace, and can exacerbate 
symptoms and complicate treatments (Box 2). 

Infectious tropical diseases inflict a heavy economic burden, costing 
LMIC economies billions of dollars every year. For example, the annual 
global cost of dengue in 2013 was estimated at US$8.9 billion‘, anda 
study carried out in the Philippines indicated that schistosomiasis 
resulted in the loss of an average of 45.4 work days per infected person 
per year. 

For many of these diseases the current treatments are unsatisfactory, 
and for some there are few or no treatment options available. There is an 
urgent need for new, safe, low-cost and efficacious treatments. Tackling 
these diseases effectively requires an integrated multi-sectoral approach 


| nfectious tropical diseases mainly affect low- and middle-income 


that links chemotherapy with prevention initiatives such as improved 
water supply, sanitation and hygiene, vaccines and vector control. Various 
governmental and non-governmental organizations, charities, founda- 
tions, product-development partnerships, academic groups and pharma- 
ceutical companies are supporting drug discovery for tropical diseases. 
A breakdown of funding is summarized in the G-Finder® and IFPMA 
2017’ reports. In 2016 about two-thirds of the total funding for research 
and development for neglected diseases was spent on HIV/AIDS (34%), 
malaria (18%) and tuberculosis (18%)°. Overall research and development 
funding has markedly increased in the last decade but the gains have not 
been distributed evenly across diseases. Greater efforts, resources and 
funding are urgently needed to meet the considerable unmet medical 
needs. 

In this Review, we will consider the specific challenges associated with 
drug discovery for tropical diseases and examine some of the recent 
successes. 


Discovery pathways and challenges 
Figure 2 shows the pathway for a typical small-molecule drug-discovery 
programme for an infectious disease. 


The drug-discovery pathway 

‘Hit discovery’ is the first step on the drug-discovery pathway, and 
is where chemical starting points are identified. This usually involves 
screening collections of compounds against a molecular target, typi- 
cally an enzyme (target-based screening), or against whole organisms 
(cell-based or phenotypic screening). Alternative approaches in hit dis- 
covery include structure-based drug discovery, re-purposing of drugs 
from other disease areas, and in silico methods. Next, ‘hit expansiom 
is performed to verify if a hit is genuine and whether there is scope 
for further work with the chemotype. The compound is then refined 
through a cyclic process of ‘design—make-test’ in the ‘hits to lead’ 
phase, until it has significant activity, typically in an animal model of 
infection. Subsequently, the biological activity, pharmacokinetics and 
safety profile of the series are optimized and balanced during the ‘lead 
optimization’ phase. ‘Candidate selection marks an important mile- 
stone in the drug-discovery pathway, as it involves selecting a single 
compound for progression. In this process, regulatory toxicology and 
scale-up are carried out to enable initial human studies. In human 
‘clinical trials, the compound is first tested in healthy volunteers to 
measure pharmacokinetics and safety (phase I); this is followed by 
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Box | 
Infants and children 


Many of the infectious tropical diseases disproportionately affect 
infants and children. In the case of malaria, the majority of deaths 
occur in children under the age of five, as they are immune- 

naive. Diarrhoea is a major problem among children, causing 
hundreds of thousands of deaths per year. The Global Enteric 
Multicentre Study (GEMS), a case-control study conducted at 
seven sites in Africa and south Asia, is the most comprehensive 
study of childhood diarrhoea to date!!#!13, The most common 
pathogens found in this study were Shigella spp., rotavirus, 
adenovirus 40/41, heat-stable enterotoxin-producing Escherichia 
coli, Cryptosporidium spp. and Campylobacter spp. In some 

cases, mixed infections were an issue. Some tropical diseases, 
such as cryptosporidiosis, schistosomiasis and soil-transmitted 
helminths!14115, are associated with malnutrition, growth stunting 
and impaired cognitive development in children. In addition, the 
drug-development pathway for infants and children is complicated. 
Pharmacokinetics are much more difficult to predict in infants and 
children and additional safety testing is required, making clinical 
trials challenging. Furthermore, ethical regulations in clinical trials 
for children are more complex. 


studies to establish efficacy (phase IT) and then large-scale efficacy and 
safety studies (phase III). 


Target product profiles and compound progression criteria 
The drug-discovery pathway should be guided by ‘target product pro- 
files’ (TPPs). These describe the desired features required of the final 
drug product, such as duration of treatment, whether the treatment is 
oral or parenteral, cost of treatment, acceptable safety margins, among 
others®. TPPs have been defined for malaria’, leishmaniasis!°, HAT’, 
Chagas disease!’, cryptosporidiosis!* and dengue fever’, but are absent 
for many other diseases, hampering the drug-discovery process. TPPs 
are used to develop ‘compound progression criteria, which inform the 
required disease-specific profile of a compound at every step along 
the drug-discovery process. These include, for example, activities in 
particular assays, pharmacokinetic profiles, and selectivity parameters’. 
Medicines for Malaria Venture (MMV) has several examples of com- 
pound progression criteria (https://www.mmv.org/research-develop- 
ment/information-scientists). 


Challenges 

For many tropical infectious diseases, there is little or no precedent for 
developing small-molecule drugs. This is exacerbated by insufficient 
understanding of the pathogen biology. As a result, there are often no 
relevant cellular or predictive animal models of the human disease. 
There is also frequently little or no data from clinically active molecules 
to help researchers to define the profile that a new drug should have 
in terms of its activity in preclinical assays and animal models, and its 
pharmacokinetics. The key issues facing researchers seeking to discover 
new drugs to treat tropical diseases are summarized in Fig. 2 and are 
considered below. 


Biological challenges. There are very few well-validated molecular drug 
targets for tropical diseases, in part owing to a lack of understanding of 
the detailed biology of many of the pathogens. For example, the func- 
tions of many proteins are unknown, or have only been inferred from 
other organisms. Therefore a key challenge is the selection of a suitable 
molecular target to limit the risk of subsequent failure®. Criteria used 
to help to select appropriate targets include essentiality, druggability, 
assayability and the opportunity for selectivity over host orthologues’. 
Even if a target complies with all these criteria, inhibitors frequently fail 
to demonstrate cell-based activity owing to poor permeability, inability 


REVIEW 


to compete with high substrate concentrations in cells, or because the 
compounds cannot inhibit the target sufficiently to kill the pathogen. 
The deficiency of genetic tools for many disease-relevant organisms is 
a key reason why so little essentiality data are available. However, new 
technologies are emerging!*"'®, including CRISPR-Cas9!”"8, which 
offer the prospect of markedly increasing the number of validated tar- 
gets available in the near future. 

Large-scale cell-based (phenotypic) screening is currently a popular 
approach to identify new chemical start points, as it does not require 
prior knowledge of the molecular target'®”°. Developing assays that 
have sufficient throughput to allow screening of compound libraries 
while maintaining relevance to disease pathophysiology is challenging, 
but helps to reduce attrition when using in vivo models and, ultimately, 
in clinical trials”! As assays increase in relevance, they usually become 
more complex and require more resources, both in terms of specialist 
equipment as well as staff to develop and run the assays. Other chal- 
lenges may include the lack of robust in vitro culture systems (for exam- 
ple, for Cryptosporidium), the inherent safety challenges of handling 
disease-causing agents in bulk, the lack of standardization between 
different laboratories working on the same organisms, and the use of 
laboratory strains and cell lines rather than recent clinical isolates and 
primary cells*?”. 

Existing animal models of tropical disease infections are often poorly 
representative of the human disease. For example in onchocerciasis 
(river blindness), the causative worm (Onchocerca volvulus) is only 
infective to primates; consequently, related, but different, worm species 
are used in mice or cattle models. Plasmodium falciparum, the pathogen 
that causes the most severe form of human malaria, does not infect 
rodents normally; however, a SCID mouse model has been developed 
that can be infected by P. falciparum*’. Work is also ongoing to develop 
a mouse model of human malaria caused by Plasmodium vivax’. Most 
mouse models of tuberculosis do not replicate the human disease well, 
and do not form the granulomas that are typical of the human disease. 
Encouragingly, progress is being made on developing animal models 
that more closely replicate human pathology*>-”’. 

Recent human clinical trials of azoles, which inhibit the enzyme 
CYP51, for treatment of Chagas disease have not been successful. A 
mouse model has now been developed that can distinguish between 
benznidazole, which is known to have clinical efficacy, and an azole 
(posaconazole), which does not”®. Better differentiation at this preclin- 
ical stage should improve the success rates of clinical trials. 


Quiescence and dormancy. Dormant infections further complicate the 
aetiology of some diseases. In malaria, P. vivax and Plasmodium ovale 
can remain dormant (as hypnozoites) in hepatocytes of the human host 
for weeks to years, potentially causing relapse”’. Latent tuberculosis, in 
which clinical symptoms are absent but there is a risk of progression 
to clinical disease, is estimated to affect a third of the global popula- 
tion*®. Dormancy has recently been identified in Trypanosoma cruzi”. 
Quiescent and dormant pathogens have reduced metabolism compared 
to actively dividing pathogens and may therefore be less susceptible to 
drugs. Developing assays for these forms of pathogens is problematic. 
To effectively treat and eliminate these diseases, new drugs are needed 
that effectively tackle the reservoir of dormant and quiescent infections. 


Chemistry. It is essential that the compound used for treatment can be 
dosed appropriately, usually orally, and can reach the part of the body 
where the pathogen is located. The physicochemical properties of a 
molecule, such as molecular weight, solubility, lipophilicity, charge, and 
number of hydrogen bond donors and acceptors are fundamental to 
achieving this. 

In some cases, penetration into the pathogen is problematic. For 
example, Gram-negative bacteria are surrounded by an outer mem- 
brane; drug molecules can penetrate this through proteins that form 
channels through the membrane called porins, which tend to select 
for hydrophilic molecules. The drug then has to traverse the cell 
membrane, which is a lipid bilayer; this requires compounds with 
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Fig. 1 | Deaths and disability-adjusted life years (DALYS) due to tropical 
diseases. From WHO estimates of cause-specific mortality (2015)' and 
disease burden (2015)*. Diarrhoeal diseases, tuberculosis, HIV/AIDS, 


a degree of lipophilicity. Therefore, to penetrate into the cytoplasm, 
compounds require very defined chemical properties, and are likely 
to be more polar and charged than typical drugs*”*?. Having pene- 
trated to the cytoplasm, the drug then risks being pumped out of 
the bacteria through efflux transporters. Pathogens such as T. cruzi, 
Chlamydia trachomatis, Mycobacterium tuberculosis and viruses spend 
the majority of their life cycle inside human host cells, which can 
impede drug access. Some pathogens localize to extreme intracellu- 
lar environments such as acidic organelles (for example, Leishmania 
spp. or Salmonella spp.) or necrotic granulomas (for example, 
M. tuberculosis). In the latter case, it is a major challenge to design 
drugs that can penetrate through the non-vascularized lipid-rich 
caseum to access the pathogen*. Other infections are localized at 
sites that are protected by a blood-tissue barrier, such as the central 
nervous system (for example, HAT, Cryptococcus neoformans, Taenia 
solium, or encephalitis- or meningitis-causing viruses or bacteria) or 
the eyes (for example, C. trachomatis). To reach these sites, drugs need 
to pass through endothelial cells without being pumped back into the 
blood by P-glycoprotein*. 

The nature of these diseases means that the cost of treatment 
must be low. For malaria, the aim is to produce a treatment that costs 
less than US$1. This implies a short and cheap chemical synthesis 
and precludes complex formulations to counter issues such as 
poor solubility. Owing to the lack of temperature-controlled supply 
chains in many regions, compounds that are stable over long periods 
of time at high temperatures (>40°C) and high humidity are required. 
Finally, patient compliance is also challenging, and is often com- 
pounded by minimal medical support. Consequently, it is essential to 
minimize the number of doses and complexity of dosing regimens. For 
example, MMV is aiming for a single-dose treatment for bloodstream 
malaria’. 


Resistance. Drug resistance is a considerable problem for treatment of 
most infectious diseases, in particular for viral, bacterial and apicom- 
plexan infections****. Poor management of treatments can accelerate 
the occurrence of resistance. The most publicized current example is 
antibacterial drug resistance. In humans, antibiotics are sometimes 
taken when not required, or given at sub-therapeutic doses, potentially 
increasing the proportion of resistant bacteria within a population. 
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This problem is aggravated by the large amounts of antibiotics in 
the environment owing to indiscriminate use in agriculture and 


aquaculture. 


Table 1 | The pathogens of common tropical diseases 


Disease 


Pathogen 


Type 


Diarrhoeal diseases 


Tuberculosis 
HIV/AIDS 


Malaria 
Hepatitis 
Dengue 

Rabies 
Cysticercosis 
Schistosomiasis 
Leishmaniasis 
Echinococcosis 
Leprosy 

Yellow fever 
Chagas diseases 


Food-bourne trematode 


African trypanosomiasis 


Ascariasis 


Hookworm diseases 


Lymphatic filariasis 


Onchocerciasis 
Trachoma 


Trichuriasis 


Multiple pathogens 


Mycobacterium tuberculosis 


Human immunodeficiency 
virus 


Plasmodium spp. 
Hepatitis A, B, C, D, E 
Dengue virus 
Rhabdoviridae 
Taenia solium 
Schistosoma spp. 
Leishmania spp. 
Echinococcus spp. 
Mycobacterium leprae 
Yellow fever virus 
Trypanosoma cruzi 


Clonorchiasis, Opisthorhiasis, 
Fascioliasis, Paragonimisaisis 


Trypanosoma brucei 


Ascaris lumbricoides 


Ancylostoma duodenale, 
Necator americanus 


Wuchereria bancrofti, Brugia 
malayi, Brugia timori 


Onchocerca volvulus 
Chlamydia trachomatis 


Trichuris trichiura 


Bacteria, Viruses, 
Protozoa 


Mycobacterium 


Virus 


Protozoan 
Virus 
Virus 
Virus 
Tapeworm 
Helminth 
Protozoan 
Tapeworm 
Mycobacterium 
Virus 
Protozoan 


Helminth 


Protozoan 


Soil-transmitted 
helminth 


Soil-transmitted 
helminth 


Nematode 


Nematode 
Bacterium 


Soil-transmitted 
helminth 


The diseases referred to in Fig. 1 are described with their corresponding pathogens. 
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Fig. 2 | The drug-discovery process. Top, the stages of the drug-discovery 
process; middle, generic challenges; bottom, additional key challenges 
at particular stages of the drug-discovery process for neglected diseases. 


Combination therapy is being adopted for many diseases in an 
attempt to slow development of drug resistance. This has been particu- 
larly successful in the treatment of HIV/AIDS, in which the resistance 
rate for single therapy” is exceptionally high. Combination treat- 
ment has also proved very effective against tuberculosis and malaria. 
Worryingly, in the case of malaria, resistance to artemisinin combina- 
tion therapy, the principal form of treatment*®, is now developing in 
southeast Asia. There is therefore a need for drugs with novel modes 
of action to deliver new combination therapies for which there is no 
clinical resistance to any of the components. 


Progress 

Despite all these challenges, progress is being gradually made. Some 
examples are highlighted below, although this is not a comprehensive 
list. Such successes are due to coordinated and substantial investments 
in research, along with concerted efforts to understand the TPPs and 
drug-discovery pathways. 


Apicomplexans 

With 200 million cases a year, the human and economic cost of malaria 
is extremely high*”. Africa bears a disproportionately high share of the 
burden with more than 400,000 deaths a year. Multiple new types of 
antimalarial drugs are needed to overcome resistance, give single-dose 
treatments, prevent relapse of P vivax infections, block transmission 
and to act as chemopreventatives. The malaria drug-discovery portfolio 
has markedly improved over the past decade*!. The starting points for 
the majority of the compounds currently in development were iden- 
tified by phenotypic screening in collaborations with MMV. In many 
cases, target deconvolution has subsequently led to the identification 
of the molecular targets. There are a number of novel compounds in 
clinical trials supported by or in collaboration with MMV (Fig. 3). 
The following are derived from phenotypic screens and are in phase II 
trials: the spiroindolone KAE609, the imidazolopiperazine KAF156” 


Translation from enzyme assays to cellular assays to animal models to the clinic 


Need for 


oralleiues Cost and stability 


Drug resistance 


Population-specific 
toxicity or 
pharmacokinetics 


Difficult to reach compartments, 
complex pharmacokinetics or 
pharmacodynamics 


Unknown clinical 
development path 


Tractable chemistry and cost of goods Access to patients 


Biological and pharmacokinetic issues are shown in lilac, chemical issues 
are shown in light blue and clinical issues are shown in red. 


(a combination trial with lumefantrine), the synthetic trioxolane 
artefenomel (OZ439, in combination with ferroquine)* and the 2- 
aminopyridine MMV048. The mode of action has been determined 
for some of these (P. falciparum (Pf) ATP4 for KAE609“* and PfPI4K 
for MMV048*), whereas the modes of action of KAF156 and OZ439 
remain uncertain. KAF156 has been linked to the cyclic amine resist- 
ance locus (Pfcarl)** and OZ439 probably acts through formation of 
carbon radicals, whereas ferroquine acts to prevent haem detoxifi- 
cation. In phase I trials are: the dihydroisoquinoline SJ733, another 
PfATP4 inhibitor*’; and M5717, which is active against translation 
elongation factor 2*8. Two compounds derived from target-based 
projects are also in clinical trials: the dihydroorotate dehydrogenase 
inhibitor DSM265* is in phase II trials, and the dihydrofolate reduc- 
tase inhibitor P218 is in phase I trials. 

We envisage an increase in target-based drug discovery as mode-of- 
action studies are revealing the chemically validated targets of many 
of the recently identified phenotypic hits. There are systematic efforts 
trying to identify the mode of action of phenotypic hits°°. New types 
of phenotypic screens have been developed that target life-cycle stages 
in addition to, or other than, the asexual blood stages>!>*. Clinical 
development for malaria has accelerated thanks to the development 
of human challenge models in which healthy volunteers are infected 
with treatable malaria to obtain an early indication of drug efficacy?" 

Cryptosporidiosis is a major cause of diarrhoea in children in tropical 
countries (Box 1). Nitazoxanide, the current treatment for the disease, 
has variable efficacy in immunocompetent patients and is not effective 
in immunocompromized patients*°*. There is an urgent need for new 
drugs to treat children under 24 months of age, especially those who are 
malnourished, immunocompromized or suffering from chronic diarrhoea. 

There have been recent advances in the cryptosporidiosis discovery 
pipeline. A series of bumped kinase inhibitors of Cryptosporidium 
parvum calcium-dependent kinase 1°” has been developed, an exam- 
ple of which is BKI1534. Promisingly, recent advances in C. parvum 
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Box2 
Co-infections 


Co-infections are associated with high prevalence and the 
extensive geographical overlap of some of these diseases. A person 
who is already infected with one pathogen may, in some cases, 
have an increased risk of infection with another pathogen!!4. Some 
of the issues around co-infection are: 

e Co-infections frequently accelerate disease progression and 
outcomes as one disease enhances the effect of another. For 
example, malaria infection increases the viral load of HIV-infected 
patients, also making it easier for that person to pass on the HIV 
virus!!©, HIV-tuberculosis co-infection is particularly problematic; 
HIV weakens the immune system, allowing latent tuberculosis to 
become activated and progress more rapidly. In turn, tuberculosis 
also accelerates the progression of HIV infection!!’. 

e Co-infections can result in complications in pregnancy and 

impair growth and development in children. For example, co- 
infections of malaria and hookworm can lead to severe depletions in 
haemoglobin resulting in severe anaemia!!8. 

e Co-infections can complicate treatments due to drug—drug 
interactions which can affect efficacy and can give rise to 
cumulative drug toxicities producing adverse side effects!1°. This is 
a particular problem in HlV—-tuberculosis co-infection. Rifampicin, 
which is commonly used to treat tuberculosis, causes induction 

of cytochrome P450s. The latter metabolize some of the more 
commonly used anti-HIV drugs!@°. This means that careful selection 
of treatments is required. 

e Co-infections can affect how diseases are treated. For example, 
ivermectin treatment for onchocerciasis may induce severe adverse 
reactions in some people with severe Loa /oa infection!?!!22, 


high-content imaging infection assays in human intestinal epithelial 
cells have made phenotypic screening of medium- and large-size com- 
pound libraries possible. A screen of compounds active against malaria 
has led to the identification of a preclinical candidate, KDU731°°, 
an inhibitor of the Cryptosporidium phosphatidylinositol 4-kinase. 
Clofazimine, an approved drug for the treatment of leprosy, was iden- 
tified from a screen of bioactive compounds” and is entering clinical 
trials for cryptosporidiosis. 


Kinetoplastid diseases 

This area has recently been reviewed in detail®. Mortality rates for 
HAT (also known as sleeping sickness) have decreased substantially in 
recent years®!; however, there is still a considerable disease burden in 
central Africa. The advent of nifurtimox—eflornithine (NECT) com- 
bination therapy in 2009 marked an important step in the treatment of 
stage two T: brucei gambiense HAT. This has reduced the duration of 
treatment as well as decreasing adverse effects®*. Nevertheless, treat- 
ment is still problematic because the two subspecies that cause the 
disease (T: brucei gambiense and T. brucei rhodesiense) and the two 
stages (acute and central nervous system) require different drugs®'. The 
requirement for staging patients before treatment also limits uptake, as 
it involves a painful and technically challenging lumbar puncture®’. The 
ongoing development of two new drugs has the potential to transform 
the clinical management of HAT. The first of these drugs is fexinida- 
zole, a nitroaromatic compound developed in the 1970s and 1980s as 
a broad-spectrum antibiotic, which was more recently found to have a 
suitable profile for clinical development for HAT®*. Drugs for Neglected 
Diseases initiative (DNDi) completed a phase III clinical trial in 2017 
and reported non-inferiority over NECT for both stages of the disease 
with a ten-day oral treatment®. The second is an oxaborole compound, 
SCYX-7158 (acoziborole)*’, which is currently undergoing a pivotal 
phase II/III trial!®. This has the potential to provide a single-dose oral 
treatment for all forms of HAT. 
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Less progress has been made for leishmaniasis and Chagas disease. 
Current treatments, particularly liposomal amphotericin-B (which is 
given intravenously), together with other measures, such as vector con- 
trol and effective surveillance, have led to progress towards elimination 
or control of visceral leishmaniasis (VL)® in the Indian subcontinent. 
However, current drugs are less effective or unsuitable in other areas 
of the world. New oral drugs are urgently needed to treat VL in these 
regions and to provide improved treatment options in the Indian sub- 
continent. Furthermore, HIV-VL co-infection is also currently poorly 
treated. Several potential oral treatments identified through phenotypic 
screening are in preclinical development, including an oxaborole and 
a nitroimidazole®. The discovery of a pan-kinetoplastid proteasome 
inhibitor”” (GNF6702, Fig. 3) offers potential for development of new 
drugs for leishmaniasis, Chagas disease and HAT. The majority of pre- 
clinical work on leishmaniasis focuses on VL, the deadly form of the 
disease, rather than cutaneous leishmaniasis (CL). New oral therapies 
for CL are urgently needed as well. Research on Chagas disease suffered 
a setback recently as a consequence of the clinical failures of the CYP51 
inhibitors posaconazole and fosravuconazole’!72, Currently, few treat- 
ments are under development for this challenging disease. 


Viruses 

Viruses such as HIV, hepatitis C virus (HCV), dengue and rabies have 
a large global impact (Fig. 1), resulting in over 1 million deaths in 
2015'. Effective treatments for HIV and HCV have been developed’*”*. 
Other viruses, including flaviviruses, arenaviruses, coronaviruses and 
filoviruses still pose a considerable threat for tropical countries and 
others, and pose a particular risk of global pandemics, as exemplified 
by the recent outbreaks of Zika”®, chikungunya’®, dengue”’, Ebola’’ and 
Middle East respiratory syndrome (MERS)””. Since future epidemics 
are almost inevitable, and could be triggered by any member of these 
families, drug-discovery efforts rightly focus on broad-acting antivirals®°. 
These diseases have received relatively little attention in terms of 
small-molecule drug discovery. However, both target-based and cell- 
based hit discovery approaches have yielded new compounds of interest, 
but to date none of these have been developed to a similar degree as the 
HCV and HIV drugs®. The most advanced compounds are favipiravir, 
GS-5734 (remdesivir) and BCX4430 (galidesivir), which target the viral 
polymerase (Fig. 3). Favipiravir was tested in humans during the Ebola 
virus epidemic of 2014. Although conditions complicated the study, it 
was concluded that further investigation was warranted*!. GS-5734 
has shown activity in rhesus monkey model of Ebola®’, and was 
also used in humans during the 2014 Ebola outbreak. It is currently 
undergoing a phase I clinical trial in West Africa. Both favipiravir and 
GS-5734 are being considered for use in the 2018 Ebola virus outbreak 
in the Democratic Republic of the Congo®’. BCX4430 has demon- 
strated complete protection against Marburg virus in non-human pri- 
mates and against Ebola in rodent models when administered within 
48 h of infection. This compound also shows broad activity against 
other viruses, including arenaviruses and flaviviruses®**°, and is cur- 
rently in clinical development. Inhibitors that target the viral protease 
are not as far advanced (with the exception of HIV and HCV); however, 
they also show promise as broad-spectrum antivirals. The potential 
of targeting host pathways is also being explored®***, Although new 
small-molecule antivirals are being discovered, there is an urgent need 
to translate more such treatments into the clinic, alongside ongoing 
vaccine development. 


Bacterial diseases 

Tuberculosis continues to be a major problem across the world. As 
with malaria, combination treatments are used against tuberculosis to 
combat resistance, and can circumvent issues of tuberculosis residing 
in multiple environments and metabolic states. The need for combina- 
tion therapy complicates the progression of new compounds through 
clinical trials (https://www.tballiance.org/portfolio). Recently, two new 
drugs, bedaquiline and delamanid, have been conditionally approved 
for the treatment of specific levels of drug-resistant tuberculosis on 
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Fig. 3 | Key compounds discussed in the text. 


the basis of phase IIb trials, and are currently undergoing phase III 
studies*’. Bedaquiline targets ATP synthase® and is active against the 
latent forms of the bacterium®’. Delamanid is a nitro heterocycle; its 
precise mode of action has not been determined, but it is probably 
a prodrug that requires activation inside the bacterium. It has been 
implicated in inhibition of mycolic acid biosynthesis, which is critical 
for the structure of the cell wall of M. tuberculosis”®”'. 

A number of other compounds that are currently under development 
for treatment of tuberculosis target protein synthesis, the respiratory 
chain or the cell envelope (https://www.tballiance.org/portfolio and 
https://www.newtbdrugs.org/pipeline/clinical). Several different steps 
in protein synthesis are being studied. Linezolid has demonstrated 
activity against drug-resistant tuberculosis, and has a marked effect on 
patients with extensively drug-resistant tuberculosis”. It is a member 
of the oxazolidinone class of molecules, and targets the 50S ribosomal 
subunit. However adverse effects are associated with the extended treat- 
ment periods with linezolid that are required for multi-drug-resistant 
tuberculosis. This is linked to mitochrondrial toxicity’*. Other 
oxazolidinones, such as sutezolid and delpazolid, are undergoing clinical 
trials®”. Avoiding mitochondrial toxicity should allow development of 
oxazolidinone analogues with higher tolerability. GSK070, an oxaborole 


inhibitor of leucyl-tRNA synthetase, an enzyme involved in protein 
synthesis, is progressing through clinical trials?*. GSK070 forms a 
covalent adduct with the 3’-adenosyl acceptor nucleotide of the tRNA, 
its boron binding to the 2’- and 3/-hydroxyl groups. This complex is 
formed in the editing site, inactivating the enzyme. 

Q203 has just entered clinical trials and targets the respiratory pathway**, 
inhibiting cytochrome bc;. This is on the same pathway that is targeted by 
bedaquiline, although it acts at a different step. Hitting multiple points on 
this pathway appears to enhance killing of the bacterium®. 

B-lactams, which target cell wall biosynthesis, are also being re- 
investigated for treatment of tuberculosis. Carbapenems appear to have 
the most potential in this class of compounds, because of the proteins 
that they target and because they are much weaker substrates of the 
M. tuberculosis B-lactamases*’. Several enzymes involved in cell 
envelope biosynthesis are promiscuous targets, including DprE1 
and MmpL3. The former is involved in the biosynthesis of an arab- 
inogalactan polysaccharide that is critical for cell wall biosynthesis. 
Benzothiazones have been found to be covalent inhibitors of this 
enzyme; one of these compounds, BTZ043, is in preclinical develop- 
ment, and another, PTBZ169, is undergoing clinical evaluation (https:// 
www.newtbdrugs.org/pipeline/clinical). Two non-covalent inhibitors 
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of DprE1 are also undergoing clinical evaluation, both in phase I, an 
azaindole, TBA-7371, and OPC-167832. MmpL3 is required for the 
export of mycolic acid precursors, which are essential for cell wall 
biosynthesis. Compound SQ109, which targets this enzyme and probably 
others®’, is in clinical trials". Several preclinical drug-discovery 
programmes are targeting the enoyl reductase InhA, which is involved 
in the biosynthesis of mycolic acid. InhA is also the target of the prodrug 
isoniazid. 

The nitroimidazole pretomanid (PA-824), which is active against 
both replicating and non-replicating mycobacteria, is in clinical trials. 
Like delamanid, pretomanid is likely to be a prodrug. A number of 
other compounds™ are undergoing clinical evaluation (https://www. 
newtbdrugs.org/pipeline/clinical), and several fluoroquinolines are 
being investigated in combination therapies, including moxifloxacin 
and levofloxacin. 

Whereas antibacterial drug resistance is not technically a neglected 
or tropical disease, its impact is most clearly seen in LMICs, where 
there are very high levels of drug resistance. The WHO has recently 
published a priority list of pathogens”. The priority 1 pathogens are: 
Enterobacteriaceae, which are resistant to carbapenems and third gen- 
eration cephalosporins, and Acinetobacter baumannii and Pseudomonas 
aeruginosa, which are carbapenem-resistant. New drugs are urgently 
needed to tackle these pathogens, but there are few drugs in clinical 
development that address them”. In addition, several bacterial infec- 
tions, such as shigellosis and typhoid, almost exclusively affect LMICs. 
Drug resistance is a major issue, and worryingly, in some instances, 
the resistant bacteria have a higher fitness than wild-type bacteria?” 
Treatment is reliant on existing antibacterials. 


Helminths 

At least one billion people, and probably more, are estimated to 
have one or more helminth infections’. Nevertheless, the only new 
antihelminthic drug to emerge in the last 30 years is tribendimidine, 
which shows promising broad-spectrum activity. Recent clinical 
trials show that it is a suitable alternative for the benzimidazoles 
and praziquantel in various helminth infections!°”!. Beyond this, 
development of new treatments is focused on repurposing drugs. This 
has mostly involved anthelminthics that are approved for veterinary 
use. Oxantel pamoate, a licensed veterinary drug since 1974, has shown 
promising efficacy in human clinical trials for trichuriasis (whipworm), 
one of the more difficult-to-treat soil-transmitted helminths!”?, and in 
combination with albendazole and pyrantel pamoate, it has recently 
demonstrated high clinical efficacy against hookworm infections!™. 
Two other veterinary drugs, emodepside and moxidectin, show 
promising activity against several human helminths, and are currently 
undergoing clinical trials for onchocerciasis™°. Some antimalarial 
compounds show activity against schistosomes in vitro and in vivo. 
Mefloquine—praziquantel and mefloquine—artesunate—praziquantel 
combinations were tested in clinical trials against chronic Schistosoma 
haematobium infection. Unfortunately, adverse event rates were high 
and there was no increased efficacy compared to praziquantel’. In 
the case of filarial worms, an alternative strategy is targeting of the 
Wolbachia endosymbiotic bacteria; this results in slow killing of the 
worms, thereby avoiding serious adverse events. For example, four 
weeks of treatment with doxycycline cured people suffering from 
onchocerciasis. The major aim of Wolbachia-directed therapy is to 
find shorter-course antibiotics'©”. Several high-throughput phenotypic 
screens have been carried out to discover new chemical entities that act 
against helminth infections!°*!", but no new clinical candidates have 
been reported to date. 


Fungal infections 

Fungal infections are particularly serious in immunocompromized 
patients. In addition to the common fungal infections, several are found 
predominantly in tropical regions. One of the most important problems 
is meningitis due to C. neoformans, which is estimated to cause 180,000 
deaths per year among people infected with HIV, the majority of whom 
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are in Africa'!°. This is poorly treated at present, and there is a need for 
new more efficacious drugs. Mycetoma, a disfiguring infection of the 
skin and soft tissue, is a particularly neglected disease, and is endemic 
to tropical and subtropical areas. Mycetoma can be caused by bacte- 
ria (actinomycetoma) or by fungi (eumycetoma). Fungal mycetoma 
requires long treatment with azoles, but drug resistance, disease recur- 
rence and side effects are common. Patients often develop deformities 
that require amputations. DNDiare currently running a phase II/III 
trial for fosravuconazole to assess whether this azole is more effective 
and safer than the the currently used azoles itraconazole and ketocona- 
zole. A comprehensive review of the current antifungal pipeline has 
been published recently". 


Conclusion 

The current treatments used against tropical diseases are suboptimal, 
and in some cases, there are no drugs available. However, recent pro- 
gress in drug discovery in human African trypanosomiasis, tubercu- 
losis and malaria show that headway can be made with the concerted 
efforts of governments, charities, foundations, product-development 
partnerships, academic institutions and pharmaceutical companies. 
There is still a long way to go, even in these disease areas. The high 
attrition rate in clinical trials and issues of resistance mean that there 
is no room for complacency. Further, multiple new agents are required 
for each disease area to allow combination therapies. While there has 
been progress in some disease areas, there is much less support and 
activity in others. 
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Acting on non-communicable diseases 
in low- and middle-income tropical 


countries 


Majid Ezzati!?**, Jonathan Pearson-Stuttard!?, James E. Bennett? & Colin D. Mathers‘ 


The classical portrayal of poor health in tropical countries is one of infections and parasites, contrasting with wealthy 
Western countries, where unhealthy diet and behaviours cause non-communicable diseases (NCDs) such as heart disease 
and cancer. Using international mortality data, we show that most NCDs cause more deaths at every age in low- and 
middle-income tropical countries than in high-income Western countries. Causes of NCDs in low- and middle-income 
countries include poor nutrition and living environment, infections, insufficient taxation and regulation of tobacco and 
alcohol, and under-resourced and inaccessible healthcare. We identify a comprehensive set of actions across health, 
social, economic and environmental sectors that could confront NCDs in low- and middle-income tropical countries 


and reduce global health inequalities. 


asitic diseases that thrive and spread in hot and humid climates 

(http://www.who.int/topics/tropical_diseases/en/ and http://www. 
astmh.org/education-resources/tropical-medicine-q-a/major-tropical- 
diseases), rather than NCDs like cancers and heart disease, which are 
considered ailments of wealthy urbanized Western nations!”. NCDs refer 
to conditions other than injuries, maternal and perinatal conditions and 
nutritional deficiencies that do not involve an acute infection or parasite, 
and hence are not transmittable from person to person. 

In the 1970s, Omran proposed? a theory of epidemiological transition 
in which, as countries develop economically and ‘modernise; there is a 
shift from communicable (infectious) diseases to NCDs. This linear per- 
spective on health development has led the bulk of domestic and interna- 
tional health financing and policy in low- and middle-income countries 
to focus on infectious diseases and maternal and child health’. 

As worldwide data on deaths and disease have become more available, 
it has become increasingly apparent that NCDs also affect the popula- 
tions of low- and middle-income nations*~’. The growing evidence for 
the relevance of NCDs as a global health issue led to a United Nations 
high-level meeting on the prevention and control of NCDs, following 
which the World Health Organization (WHO) member states committed 
to reducing premature mortality from four NCDs (cancers, cardiovascular 
and chronic respiratory diseases, and diabetes in people in the age range 
30-69 years) by a quarter relative to their 2010 levels by 2025. NCDs 
have also been included in the United Nations Sustainable Development 
Goals (SDGs), with a target of reducing by one-third, relative to their 2015 
levels, premature mortality from NCDs and promoting mental health and 
well-being by 2030; the indicator used to measure progress towards this 
target is the same as that of the WHO target. 

Despite increasing awareness of the importance of NCDs as a global 
health issue, there is limited evaluation of how similarities and differ- 
ences in NCD composition between low- and middle-income tropical 
and high-income countries should influence policies and programmes 
that aim to reduce their health burden®**”. In this Review, we use interna- 
tional data to compare NCDs between low- and middle-income tropical 


p oor health in the tropics often brings to mind infectious and par- 


and high-income Western regions (Box 1), and use these comparisons 
to identify policies and interventions that could reduce their burden. 
We begin by describing the overall mortality burden and composition 
of NCDs in tropical regions in comparison to high-income countries. 
We then evaluate some of the potential causes of regional differences of 
NCDs, including genetics, infections, nutrition, healthcare and poverty. 
Finally, we identify a comprehensive set of actions across health, social, 
economic and environmental sectors aimed at reducing NCDs in low- and 
middle-income countries. 


NCD deaths in the tropics 

In 2016, life expectancy at birth in low- and middle-income tropical 
countries ranged from less than 53 years in Sierra Leone and Central 
African Republic to above 79 years in some countries in Latin America 
and the Caribbean, for example Costa Rica and Cuba (Supplementary 
Fig. 1). With the exception of the USA, in which life expectancy at birth 
of 78.6 years was similar to that of some better-off low- and middle- 
income nations, life expectancy was consistently high in high-income 
Western countries, surpassing 83 years in Switzerland, Spain, France, 
Canada and Australia. 

Of the 25.3 million deaths in the four tropical regions—A frica, India 
(south Asia), southeast Asia, and Latin America and the Caribbean— 
in 2016, 8.7 million (34%) were from infectious and parasitic 
diseases, maternal and perinatal conditions, and nutritional deficiencies, 
14.0 million (55%) from NCDs and the remainder from injuries. The share 
of deaths from NCDs in tropical countries had the same geographical 
pattern as life expectancy, ranging from below 30% in some African 
countries to over 80% in parts of southeast Asia and Latin America 
and the Caribbean, Mauritius and Seychelles (Fig. 1a). Tropical Africa 
was the only region in which NCDs accounted for fewer deaths than 
infectious and parasitic diseases, maternal and perinatal conditions, 
and nutritional deficiencies (33% versus 57%). NCDs accounted for 
nearly 90% of all deaths in high-income Western countries, higher than 
their share in all tropical regions, as predicted by the epidemiological 
transition theory. Crude death rates from NCDs as a whole, and from 
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Box | 
Low- and middle-income tropical 
countries 


Physicians and medical researchers have had a longstanding 
interest in health in the tropics because its climate and ecology can 
influence human health directly or by changing the viability and 
dynamics of disease vectors. In the tropics and elsewhere, health 

is also strongly influenced by the economic resources available to 
countries and individuals. In this Review we evaluate the status of 
NCDs by focusing on low- and middle-income tropical countries as 
a way to bridge two parallel bodies of work in global health: tropical 
diseases and NCDs. 


Countries were classified as ‘tropical’ if half or more of their land 
area was between the Tropic of Cancer and Tropic of Capricorn. 
This classification excludes some low-income countries, such as 
Bangladesh and Pakistan. Similarly, the focus on low- and middle- 
income excludes some tropical countries, such as Singapore. For 
reporting, we organised low- and middle-income tropical countries 
into the following regions. 

(1) Tropical Africa: Angola, Benin, Botswana, Burkina Faso, 
Burundi, Cabo Verde, Cameroon, Central African Republic, 
Chad, Comoros, Congo, Cote d'Ivoire, Djibouti, DR Congo, 
Equatorial Guinea, Eritrea, Ethiopia, Gabon, Gambia, Ghana, 
Guinea, Guinea Bissau, Kenya, Liberia, Madagascar, Malawi, 
Mali, Mauritania, Mauritius*, Mozambique, Namibia, 

Niger, Nigeria, Rwanda, Sao Tome and Principe*, Senegal, 
Seychelles*, Sierra Leone, Somalia, South Sudan, Sudan, 
Tanzania, Togo, Uganda, Zambia and Zimbabwe. 

(2) Tropical south Asia: India 

(3) Tropical southeast Asia: Brunei Darussalam, Cambodia, 
Indonesia, Lao PDR, Malaysia, Maldives, Myanmar, 
Philippines, Sri Lanka, Thailand, Timor-Leste and Vietnam. 

(4) Tropical Latin America and the Caribbean: Antigua and 
Barbuda, Barbados, Belize, Bolivia, Brazil, Colombia, Costa 
Rica, Cuba, Dominican Republic, Ecuador, El Salvador, 
Grenada, Guatemala, Guyana, Haiti, Honduras, Jamaica, 
Nicaragua, Panama, Paraguay*, Peru, Saint Lucia*, Saint 
Vincent and the Grenadines*, Suriname, Trinidad and 
Tobago and Venezuela 

Countries and territories marked with an asterisk had data on 
mortality but not on cancer incidence. To simplify reporting, we 
also did not analyse a few other low- and middle-income tropical 
countries that were not in these four regions. These include Fiji, 
Papua New Guinea, Micronesia, Kiribati, Tonga, Samoa, Solomon 
Islands, Vanuatu, Yemen and Oman. 

Throughout the paper, we compare health in the four low- 
and middle-income tropical regions with high-income Western 
countries, which include Australia, Austria, Belgium, Canada, 
Cyprus, Denmark, Finland, France, Germany, Greece, Iceland, 
Ireland, Israel, Italy, Luxembourg, Malta, Netherlands, New Zealand, 
Norway, Portugal, Spain, Sweden, Switzerland, UK and USA. 


most individual NCDs, were also higher in high-income Western 
nations than in low- and middle-income tropical regions. 

Crude death rates do not take into account that the populations of 
low- and middle-income countries are younger than in high-income 
countries (Supplementary Fig. 2). For example, 83% of tropical Africa’s 
population and 70% of India’s population is younger than 40 years, 
compared to 49% in high-income Western nations. Many infections 
disproportionately affect infants and children (such as diarrhoea, 
malaria and measles) or young and middle-aged adults (such as HIV 
and tuberculosis). In contrast, deaths from most NCDs increase with 
age (although there are exceptions such as congenital anomalies and 
haemoglobinopathies like sickle cell). This combination of distinct 
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age patterns of deaths from NCDs versus infections and different age 
structures is the main reason for higher crude NCD death rates in high- 
income countries. In other words, low- and middle-income tropical 
countries have lower NCD death rates than high-income nations at 
least partly because their population is younger. 

One way to compare the epidemiological (versus demographic) 
features of NCDs across countries is to fix the age structure to a reference 
(standard) population, a process known as age-standardization (this 
is done by taking a weighted average of age-specific death rates, with 
the weight for each age group being its share of standard population)’®. 
Once this is done, NCDs as a whole, and most individual NCDs, have 
higher death rates in tropical countries than they do in high-income 
Western countries (Figs. 1b and 2). The exceptions to this pattern are 
cancers, mental, neurological and substance-abuse disorders, and 
endocrine, blood and immune disorders, for which age-standardized 
death rates are slightly lower or about the same in the tropical regions 
(Fig. 2). In other words, low- and middle-income tropical countries 
have a disadvantage in NCDs compared to their high-income Western 
counterparts, as they do for infectious diseases. The largest absolute 
disadvantage of tropical regions is that of cardiovascular diseases, 
followed by chronic respiratory diseases, diabetes, kidney and other 
genitourinary diseases, and liver cirrhosis and other digestive diseases. In 
the following sections, we examine the regional patterns of cardiovascular 
diseases and cancers, which are the largest two groups of NCDs. 

That NCDs have a substantial mortality burden in low- and 
middle-income countries is consistent with the historical experience of 
high-income countries!!-!3, For example, in 1900, the ten leading causes 
of death in the USA included infectious diseases such as respiratory 
infections, influenza, tuberculosis and diarrhoea as well as heart and 
kidney diseases, stroke and cancers; by 1930, these latter diseases 
made up four of the five leading causes of death'+. As life expec- 
tancy increased in high-income countries, age-standardized death 
rates declined for some NCDs (stroke and possibly kidney disease) 
but increased for others (cancers)!*!. 


Regional inequalities in cardiovascular diseases 
Cardiovascular diseases accounted for 5.8 million deaths (23% of all 
deaths) in the four low- and middle-income tropical regions, with their 
share ranging from 13% in Africa to 32% in southeast Asia, compared 
to 2.2 million deaths (30% of all deaths) in high-income Western coun- 
tries. Age-standardized death rates from cardiovascular diseases as a 
whole, as well as from most individual cardiovascular diseases, were 
higher in low- and middle-income tropical regions than in the high- 
income Western region (Fig. 3). 

In relative terms, the largest regional mortality disparity was that 
of rheumatic heart disease, with death rates in the African and Asian 
tropical regions many times those in the high-income Western region. 
Rheumatic heart disease, which is caused by rheumatic fever from 
group A streptococcal disease, is a classic NCD of poor communities”. 
In 2016, it accounted for nearly 300,000 worldwide deaths, nearly half of 
which occurred in tropical Africa, India and southeast Asia. However, 
even in these regions, rheumatic heart disease accounted for no more 
than 4% of cardiovascular disease deaths in 2016. 

Rheumatic heart disease was present at the beginning of the twentieth 
century in high-income countries, declining sharply and virtually 
disappearing by the 1970s, except in disadvantaged indigenous com- 
munities'*"!©. Much of this decline occurred before the introduction of 
antibiotics, owing to improvements in housing, hygiene, nutrition and 
primary healthcare!!!*!°, More recent declines in middle-income tropical 
countries such as Costa Rica and Cuba have benefited from a combina- 
tion of environmental interventions and antibiotic therapy'*. Incidence 
remains high in sub-Saharan Africa and south Asia, where systematic 
prevention and treatment programmes are rare”!”, Other cardiovascu- 
lar diseases associated with infections that are more common in low- 
income than in high-income countries include endomyocardial fibrosis, 
pericarditis and cardiomyopathy®'®!°. Like rheumatic heart disease, they 
accounted for a small share of cardiovascular disease deaths in 2016. 
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Fig. 1 | NCD mortality in low- and middle-income tropical countries 
and high-income Western countries. a, b, Share of deaths from NCDs (a) 
and age-standardized death rates from NCDs (b) in low- and middle- 
income tropical and high-income Western countries in 2016. NCDs 
include: cancers; cardiovascular diseases; diabetes; endocrine, blood 

and immune disorders; non-infectious respiratory, digestive (including 


Although the near-disappearance of rheumatic heart disease from 
high-income nations has, appropriately, put a spotlight on its persistent 
burden in the poorest countries, the largest absolute mortality inequality 
between tropical and high-income regions is unequivocally that of 
stroke and ischaemic heart disease (Fig. 3) because they have a large 
mortality burden—more than three-quarters of cardiovascular disease 
deaths in every region. Historical data from high-income countries 
consistently show a decline in stroke death rates from as early as 1900, 
driven initially by a sharp decline in haemorrhagic stroke and sub- 
sequently by both haemorrhagic and ischaemic subtypes’®. Limited 
data from middle-income countries also show similar trends!*. High 
blood pressure is the single most important risk factor for stroke, 
especially for haemorrhagic stroke”, and is addressed later in this 
Review. Some stroke risk factors, such as sickle cell, are nonetheless 
predominantly tropical”!. 

How ischaemic heart disease changes during economic and 
health development is more complex than trends for rheumatic 
heart disease and stroke, which both tend to decline as overall 
health improves! }1-16.22, In high-income countries, ischaemic heart 
disease mortality increased in the first half of the twentieth century 
and declined subsequently, with similar trends seen today in middle- 
income countries with longitudinal data including Seychelles, South 
Africa and parts of Latin America'?!®** Some studies have found a 
low prevalence of markers of coronary atherosclerosis, a precursor of 
ischaemic heart disease, in remote indigenous populations of the tropics— 
such as the Masai and Samburu in east Africa in the 1960s***4, the 
Kalahari Bushmen in the 1970s” and the Tsimane in South America 
in the 2010s”°—even though diets, including the share of carbohydrates 
versus animal-source foods, vary substantially among these groups. It 
is, however, unclear how representative these groups are of the popu- 
lations of the larger tropical regions where they live, and studies in the 
same and other tropical regions have found high levels of coronary 
disease or its precursors”*”’. The variations in ischaemic heart disease 
rates and trends even within low- and middle-income countries may be 
because the disease has diverse causes, including chronic inflammation, 
fetal and early life malnutrition and stunted growth, smoking, 
heavy episodic (binge) alcohol drinking, and low consumption of 
fruits and vegetables, which are present to varying degrees in low- and 
middle-income countries. The large burden of ischaemic heart disease 
and stroke in low- and middle-income countries, and the similarities 
and differences in their causes compared to high-income countries, 
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liver), and genitourinary diseases; neurological conditions; mental and 
substance-use disorders; congenital anomalies; and sense-organ, skin, 
musculoskeletal and oral/dental conditions. The mortality and population 
data were taken from the WHO global health estimates®’. All results are 
for females and males combined. Death rates were age-standardized to the 
WHO standard population!” 


necessitate going beyond the simple paradigm of epidemiological tran- 
sition to identify interventions that effectively address their local causes 
and health systems that are able to help patients with these conditions. 


Diverse regional patterns of cancer mortality 

Cancers accounted for 2.4 million deaths in 2016 (9% of all deaths) in the 
four low- and middle-income tropical regions, with their share ranging 
from 6% in Africa to 18% in Latin America and the Caribbean, com- 
pared to 1.9 million deaths (26% of all deaths) in high-income Western 
countries. Cancers are a heterogeneous group of diseases, so the regional 
variations may differ by type of cancer®”*. After age-standardization, 
death rates from many cancers were lower in the tropical regions than 
in high-income Western countries, with the difference largest for lung 
cancer, followed by colorectal and pancreatic cancers (Fig. 4). However, 
cervical, head and neck, liver, stomach and oesophagus cancers had 
higher death rates in all or some tropical regions than in the high- 
income Western region. Prostate cancer death rates were higher in trop- 
ical Africa and Latin America and the Caribbean than in high-income 
countries, but the opposite was true in India and southeast Asia. 

The regional differences in cancer death rates are due to a combi- 
nation of differences in incidence and survival. Variations in cancer 
incidence are, in turn, due to different exposures to its major risk factors 
including smoking and alcohol use, chemicals and radiation in the envi- 
ronment, infections, diet, reproductive and sexual practices, and meta- 
bolic and hormonal factors”*", Access to and utilization of screening 
and treatment also matters when precancerous lesions or early-stage 
cancer can be effectively treated, such as for cervical cancers. The inci- 
dence of most cancers in low- and middle-income tropical regions 
was similar to, or lower than, that in high-income countries, with the 
difference largest for prostate and breast cancers, followed by lung and 
colorectal cancers (Supplementary Fig. 3). That prostate and breast 
cancers had lower incidence but higher death rates in low- and middle- 
income regions is because of worse cancer survival®”. Cervical, liver and 
stomach cancers were the only cancers with higher age-standardized 
incidence in all or most low- and middle-income regions. Kaposi's 
sarcoma, of which HIV/AIDS is the leading cause, was unfamiliar to 
many physicians in the early 1980s, and has emerged as a leading cancer 
in Africa as a result of the HIV/AIDS epidemic, and the single most 
common cancer in countries such as Zimbabwe and Mozambique. 

Almost all cervical cancers are caused by infection with human 
papillomavirus, which is mostly transmitted via sexual contact, and 
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Fig. 2 | Comparison between low- and middle-income tropical regions 


and the high-income Western region of age-standardized death rates 
from NCDs in 2016. The mortality and population data were taken from 


around 90% of non-cardia stomach cancers (that is, stomach cancer 
not including the upper stomach) are due to Helicobacter pylori infec- 
tion (an estimated 70% of all stomach cancers are non-cardia)***4, a 
bacterium transmitted via contaminated food and water. Given their 
main routes of transmission, both infections are strongly related to the 
hygiene environment and behaviour. Cervical and stomach cancer inci- 
dence and deaths have declined steadily in high-income countries and in 
emerging economies, owing to improvements in water and sanitation, 
housing, electrification and refrigeration, and access to soap and other 
basic hygiene necessities, as well as effective screening and treatment pro- 
grammes for cervical cancer***°. Poorer countries fare worse in terms 
of the social, environmental and health-system determinants of these 
two infection-related cancers. Further, although an effective vaccine 
for human papillomavirus now exists, most lower-income nations have 
not introduced human papillomavirus vaccination, partly owing to its 
cost?®4°. Asa result, in 2016, nearly two thirds (63%) of worldwide deaths 
from cervical cancer deaths occurred in the four tropical regions covered 
in this Review, compared with 7% in high-income Western countries. 
Liver cancer, which is also more common in low- and middle-income 
tropical countries than in high-income countries, has multiple causes, 
including hepatitis B and C viral infection, mycotoxins, especially afla- 
toxin from contamination of foods such as maize and groundnuts dur- 
ing storage, liver flukes, which are transmitted by eating infected raw 
freshwater fish, smoking, alcohol use, adiposity and diabetes?8*!-43, In 
high-income countries, hepatitis B virus infection has declined, while 
adiposity and diabetes have increased**, and alcohol use and smoking 
have declined in some countries and increased in others**. Owing to 
these complex trends, liver cancer declined in the middle decades of the 
twentieth century in high-income countries, followed by a continued 
decline in some countries and an increase in others, such as the USA 
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the WHO global health estimates**. Results are for females and males 
combined. Death rates were age-standardized to the WHO standard 
population’, 


and the UK**. Currently, the highest worldwide hepatitis B infection 
rates are those in tropical Africa, where hepatitis B virus vaccine cov- 
erage, especially for the birth dose, is lowest*”. As a result, despite the 
availability of an effective vaccine, hepatitis B infection, in combination 
with aflatoxin, remains a key determinant of the large burden of liver 
cancer in Africa”, especially in west Africa. 

Although infections remain an important cause of cancers in low- 
and middle-income tropical countries, other cancer risk factors are 
increasingly important. In particular, smoking accounts for about 20% 
of all cancer deaths in low- and middle-income countries, compared to 
30% in high-income countries”’. However, smoking is rapidly shifting 
from high-income to low- and middle-income countries owing to 
differences in taxes and other tobacco control measures, and fuelled by 
the tobacco industry’s tactics*”. Harmful alcohol use, which accounts 
for about 5% of all cancer deaths worldwide is also common in Latin 
America, and rising in Africa and Asia*®, The coexistence of cancers 
that are largely or partly caused by infections and those that are related 
to smoking and alcohol use means that reducing cancer burden in low- 
and middle-income countries must include social and environmental 
improvements such as clean water and sanitation, electrification and 
refrigeration to reduce the risk of infections, vaccination to protect 
against infections, and effective tobacco and alcohol control. However, 
the benefits of some of these actions will take decades to materialize. 
Therefore, health systems should be equipped to detect and treat 
precancerous signs or early-stage cancers”. 


Causes of NCDs in low- and middle-income countries 

The data presented in this Review demonstrate that not only are NCDs 
present in low- and middle-income tropical countries, but that also, 
after accounting for differences in age structure, the populations of 


(per 100,000 people) 


Fig. 3 | Age-standardized cardiovascular disease death rates in low- 
and middle-income tropical regions and the high-income Western 
region in 2016. A point above the dotted line has a higher death rate 
in the tropical region than in the high-income Western region, and a 
point below the dotted line has a lower death rate. Data are for females 
and males combined. Congenital heart anomalies are not classified as 
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Fig. 4 | Age-standardized site-specific cancer death rates in tropical 
regions and the high-income Western region in 2016. A point above 
the dotted line has a higher death rate in the tropical region than in the 
high-income Western region, and a point below the dotted line has a 
lower death rate. Mortality and population data were taken from the 


these countries experience a higher burden of NCDs than high-income 
Western countries. Beyond a general disadvantage, there are differences 
in the composition of NCDs across low- and middle-income tropical 
regions compared to the high-income Western region. Here we examine 
some of the potential determinants of these regional differences. 


Genetics 

Genetic variants that increase the risk of NCDs may have different 
regional frequencies. For example, the sickle haemoglobin gene (HDS), 
which is protective against malaria, is more frequent in tropical regions, 
especially in Africa, than in other parts of the world, with three-quarters 
of children with a haemoglobinopathy born in sub-Saharan Africa™. 
Within Africa, the frequency of the HbS gene is associated with the 
historical geographical distribution of malaria, supporting the hypothesis 
that a classical tropical disease has shaped the geographical distribution 
of an NCD via genetic selection?). 

It is possible that genetic susceptibility loci for other NCDs are also 
over- or under-represented in tropical regions. For example a variant 
on chromosome 17q21, which is more common in men of African 
descent than in non-African men, is associated with an increased risk 
of prostate cancer*>*4, How much this single-nucleotide polymorphism 
accounts for the higher rates of prostate cancer in tropical Africa, 
however, is unclear because its causal effect is not fully established 
and its frequency varies across Africa. More broadly, currently known 
genetic variants do not account for the large regional variations in NCD 
rates seen earlier or for the fact that disease rates change over periods of 
years and decades. Nor do genetic variants explain why migrants from 
the tropics to Western countries tend to converge to disease rates in the 
host environment within a few generations. Therefore, it is important 
to evaluate genetics alongside social and environmental determinants 
in studying worldwide disease patterns. 


Obesity and other cardiometabolic risks 

One of the factors underlying the claims that NCDs, especially ischaemic 
heart disease and stroke, are uncommon in lower-income tropical 
countries is the assumption that diet worsens as societies become more 
affluent and urbanized, leading to higher rates of obesity and diabetes, 
and higher blood pressure and lipids+”°>-8. However, affluence and 


WHO global health estimates® Death rates were age-standardized to the 
WHO standard population'®. Kaposi’s sarcoma is not shown as a separate 
category because its deaths were reported together with those from HIV/ 
AIDS in the WHO global health estimates. See Supplementary Fig. 3 for 
incidence. 


urban living can lead to improvements as well as worsening in diet. 
For example, consumption of fruits, which are beneficial for NCDs, 
is seasonal in poorer countries, while more affluent populations use 
food trade, transportation and storage to consume fruits year-round 
and hence have higher average consumption!*“°?, 

Body-mass index (BMI) and associated metabolic conditions 
also have varied patterns in relation to national income®. BMI is 
currently lower in tropical Africa and south and southeast Asia than 
in the high-income Western region, but is about the same in the latter 
region and Latin America and the Caribbean (Fig. 5)*°. In contrast to 
BMI, diabetes prevalence is about the same, or higher, in most low- and 
middle-income regions than in the high-income Western region’. The 
regional discordance between diabetes and elevated BMI, which is an 
important cause of diabetes, might be partly due to phenotypic varia- 
tions arising from inadequate fetal and childhood nutrition and growth 
in low- and middle-income countries as discussed below, leading 
to earlier onset of 3-cell dysfunction. Additionally, the better-resourced 
health systems in high-income countries can identify people at high 
risk of diabetes at an earlier stage, and use lifestyle and dietary modifi- 
cation and drugs to delay or prevent its onset. 

In an even more extreme form of disadvantage in low- and middle- 
income regions, the highest worldwide blood pressure levels are those 
in sub-Saharan Africa, with blood pressure also high in south Asia, 
whereas high-income nations have some of the lowest blood pressure 
levels in the world®'. Although historically, high as well as low blood 
pressure levels were observed in tropical countries”’, blood pressure 
levels seem to have increased in Africa and Asia over the past few 
decades, while declining sharply in high-income countries°>™. Of 
the cardiometabolic risk factors, only cholesterol reflects the classical 
epidemiological transition model, and is currently lower in tropical 
Africa, south Asia, and east and southeast Asia than in high-income 
Western countries (Fig. 5)®. 


Fetal and early-life environment and nutrition 

Nutrition and environment during the fetal, childhood and adolescence 
periods are an important determinant of NCDs. In the second half 
of the twentieth century, researchers realised that children born or 
growing during periods of famine, war and economic hardship had 
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Fig. 5 | Age-standardized cardiovascular disease death rate, mean 

BMI, mean systolic and diastolic blood pressure, mean serum total 
cholesterol and diabetes prevalence in low- and middle-income tropical 
and the high-income Western region. For each outcome, the colour 
shows the ratio for the level in each region to the average of the five 
regions. The inset text gives the actual level of each outcome. For example, 
mean BMI in high-income Western countries is 27.4 kg m~*, which is 1.12 
times the average of all five regions. Data on deaths are from the WHO 
global health estimates® and on risk factors from the NCD Risk Factor 
Collaboration (http://www.ncdrisc.org)“*>*)%, Death rates and risk 
factors were age-standardized to the WHO standard population’. 


a higher risk of some NCDs in adulthood®®. The thrifty phenotype 
hypothesis®’, which summarises these findings, postulates that early- 
life undernutrition increases the risk of NCDs, especially if followed 
by rapid or excessive weight gain®. In parallel, nutritionists and social 
scientists noted that, although there is a strong genetic influence on 
individual height, changes in height of an entire population are driven 
by social, environmental and nutritional status during childhood and 
adolescence®”!. Being taller is associated with enhanced longevity, 
lower risk of cardiovascular and respiratory diseases, and higher risk 
of some cancers”. These studies collectively demonstrate that fetal, 
childhood and adolescent environment and nutrition influence NCDs, 
with at least part of the effects mediated by linear growth. 

In the early twentieth century, the shortest heights in the world were 
those in tropical Asia and Andean Latin America’. Asians and Latin 
Americans grew taller over the course of the twentieth century, with 
some of the largest worldwide gains in height occurring in east Asia. In 
India and other south Asian nations, height in adulthood has plateaued 
at lower levels than those experienced elsewhere’. Adults in sub- 
Saharan Africa were taller than those in Asia and parts of Latin America 
in the early twentieth century, but fell behind because Africans born 
in the last three decades of the twentieth century are shorter than their 
predecessors””?. Poor early-life nutrition and stunted growth may be 
partly responsible for the fact that some NCDs are high in low-income 
countries, and decline as economies develop, the opposite of what is 
predicted by a simple epidemiological transition model. 


Infections, parasites and mycotoxins 
A distinct feature of the NCDs in tropical countries, especially in the lower- 
income regions of tropical Africa and south Asia, is the importance 
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Box 2 
Bidirectional links between 
infections and NCDs 


Not only are infectious diseases an important cause of ill health 

in low- and middle-income countries, they also affect some NCDs, 
including many cancers!9318©, That infections cause cancers 

was hypothesized centuries ago, and emerged as a subject 

of systematic research at the end of the nineteenth century, 

after Pasteur and Koch established the their role as a cause of 
disease in general?!®®, Difficulties in establishing a clear link 
between specific infectious agents and cancer, together with the 
identification of ionizing radiation, soot, tar and other chemicals 
as causes of cancer, led to a lack of interest in, and even dismissal 
of, experimental results on infection and cancers in animals. After 
electron microscopy facilitated the discovery of the endogenous 
retroviruses, which could enter the host cell and modify its 
genome, interest in infectious, especially viral, causes of cancers 
revived. Researchers took advantage of new laboratory techniques 
and epidemiological studies in populations with high cancer rates 
to identify the causal roles of human T-cell leukaemia retrovirus 
(HTLV-1) in leukaemia, hepatitis B and C viruses in liver cancer, 
human papillomavirus in cervical cancer, Epstein-Barr virus in 
Burkitt's lymphoma, and HIV in Kaposi’s sarcoma and non- 
Hodgkin’s and Hodgkin’s lymphoma?!. Further research as well as 
chance observations (for example, high rates of acute liver disease 
in poultry fed with groundnuts) led to discovering non-viral 
communicable causes of cancers including bacteria, parasitic 
flukes and fungi?!. The mechanisms for the carcinogenic effects 
of infections are varied and include oncogene activation, chronic 
inflammation, metaplasia of epithelium, release of carcinogens, 
general immunosuppression and loss of tumour suppressor 
ability!9. 

The relationship between infections and NCDs is bidirectional. 
For example, tobacco smoking, alcohol use and diabetes not only 
affect NCDs, but also increase the risk of infectious diseases®”-2!. 
The established or hypothesized mechanisms for these effects 
include reduced immune function, such as through adverse 
impacts on macrophage and leukocyte function and lower 
production of interleukins; increased viral production and virulence 
in hyperglycaemic environments or after alcohol use; and lower 
adherence to anti-retroviral treatment and riskier sexual behaviour 
for alcohol and HIV. Therefore, the rising prevalence of smoking, 
alcohol use and diabetes in low- and middle-income countries 
will set back efforts to combat classical tropical diseases such as 
tuberculosis and other respiratory infections??-®°. 

The bidirectional relationship also provides an opportunity for 
integration of disease prevention, detection and treatment?*°S, For 
example, screening tuberculosis patients can help to increase the 
currently low rate of diabetes diagnosis®8, and smoking and alcohol 
cessation and glycaemic control may improve the prognosis of 
tuberculosis patients. Direct evidence on the benefits of integration 
is currently lacking, especially from low- and middle-income 
countries with the highest infection burden, and will inevitably 
depend on extending health system access and utilization, health 
workforce training, availability of medicines for treatment, and 
patient follow-up to ensure treatment adherence. 


of diseases with infectious, parasitic and fungal origins (Box 2). As dis- 
cussed earlier, infection-related NCDs include rheumatic heart disease, 
liver cirrhosis, cervical, liver and stomach cancers, and Kaposi's sar- 
coma. Infections and parasitic diseases also contribute to other NCDs. 
For example, infectious respiratory diseases increase the risk of chronic 
obstructive pulmonary disease”4, schistosomiasis and other helminth 
infections increase the risk of genitourinary diseases”°, and inflammation 
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Box 3 


countries 


Reducing the burden of NCDs in low- and middle-income 
tropical countries requires a combination of prevention, early 
detection and treatment. We present here some interventions 

nd policies to address NCDs that have a large burden in low- 

nd middle-income countries (such as stroke, ischaemic heart 
isease and some cancers) as well as those predominant in such 
opulations (such as cervical cancer, rheumatic heart disease 

nd haemoglobinopathies). In selecting these strategies, we 

sed studies and reports that had quantitatively or qualitatively 
nalysed the expected impacts of health interventions, such 

s the Disease Control Priorities49291 the Lancet Series on 
CDs°, the Lancet Global Mental Health Series!°!, and the Lancet 
Series on Maternal and Child Nutrition!9213, as well as clinical 
guidelines such as those by the WHO and the National Institute 
for Health and Care Excellence. In relevant cases, such as for 
emergency response, we adapted the recommendations so that 
they can be implemented in settings with limited transportation 
and healthcare infrastructure. 


wma. ow 


9 oc OW 


z= 


Environment, nutrition and health behaviours 

- Provide access to safe and clean housing, water, sanitation, 

electricity and refrigeration to reduce infectious and parasitic 

causes of NCDs. 

¢ Implement public health programmes and social and 

agricultural policies that improve maternal, neonatal and child 

nutrition?04103, 

¢ Implement comprehensive tobacco and alcohol control, 

including taxation, warning labels, public smoking bans, and 

restriction of sales, marketing and advertising*910. 

« Eliminate industrial transfats in food supply; reduce salt in 

processed foods; implement food policies that increase access 

to and consumption of fruits and vegetables; and tax and restrict 
the sales and marketing of sugar-sweetened beverages and other 
refined carbohydrates!™. 

Healthcare 

¢ Implement universal, financially and physically accessible, 

high-quality primary and specialist care systems, with 

workforce, infrastructure and medicines equipped to manage 

NCDs, and effective referral pathways from primary to 

specialist care!04105, 

- Implement and increase the coverage of school and community- 
based human papillomavirus and hepatitis B virus vaccination 
(including birth dose for hepatitis B virus to prevent mother-to- 
child transmission)*?. 


Effective interventions for NCDs in low- and middle-income tropical 


- Increase access to and encourage cervical cancer screening for 
all women through opportunistic screening with visual inspection 
and outreach programmes for vulnerable groups, and treat 
precancerous lesions. 

« Treatment of streptococcal pharyngitis with effective antibiotics 

to prevent acute rheumatic fever and spread of group A 

streptococcus. 

e Measure blood pressure and use off-patent medicines to treat 
hypertension at primary care level, especially where laboratories 
for risk-based management of cardiovascular diseases are limited. 

« Equip primary care for early detection of diabetes, treatable 

cancers, and kidney disease, and referral into effective treatment 

pathways. 

« Implement newborn and paediatric NCD services for genetic 

and congenital disorders, type | diabetes, asthma and paediatric 

cancers in primary care, effective referral pathways to specialist 
care, treatment and management of such conditions and 

their complications, and patient and parent education and 

empowerment programmes. 

« Use mobile phones and other digital technologies to create a 

decentralized emergency response system, train and equip first 

responders for stabilization and risk stratification of patients 

with possible myocardial infarction and stroke, and implement 

community care pathways for rapid transfer of such patients to a 

specialized healthcare facility. 

« Implement pharmacological and psychosocial interventions for 

mental health conditions including depression and schizophrenia 

at the community and primary care level?°. 

Research and development 

- Develop an effective vaccine to prevent Helicobacter pylori 
infection. 

e Develop robust and low-cost diagnostic tools and devices for 

detection of early-stage cancers in the community setting, as has 

been done for HIV and other infectious diseases. 

« Characterize NCD and NCD risk factor phenotypes and traits— 

multi-morbidity with multiple communicable diseases and 

NCDs, and co-exposure to multiple environmental, infectious and 

physiological NCD risk factors—to identify optimal packages of 

interventions. 

- Investigate, in epidemiological and health service studies, the 

aetiological relationship between infectious diseases and NCDs, 

and the feasibility and benefits of integrating their diagnoses and 
treatment. 


as a result of chronic and repeated infections increases the risk of car- 
diovascular diseases'®”®. 

Although infections remain an important cause of NCDs in poor 
countries and communities, the burden of most infection-related 
NCDs is comparable to or smaller than that of other NCDs. For 
example, colorectal cancer, which is caused by high consumption 
of red and processed meat and low intake of dietary fibre now has 
a comparable burden to liver and stomach cancers in low- and 
middle-income tropical regions (Fig. 4). Age-standardized death rates 
from breast cancer are higher than those of cervical cancer in every 
tropical region except Africa, where their mortality burden is about 
the same. Similarly, prostate cancer has emerged as an important 
cause of death among men across low- and middle-income tropical 
regions, with a higher age-standardized death rate than liver or 
stomach cancer in some regions. Finally, NCDs with non-infectious 
origins are not only an important cause of mortality in low- and 


middle-income countries, but they may increase the risk of infec- 
tious diseases (Box 2). This reciprocal relationship should motivate a 
more integrated response to communicable and non-communicable 
diseases. 


Poverty 

More than two-thirds of people in the poorest countries in tropical 
Africa and about 12% in India live on less than I$1.90 per day (I$, 
international dollar that is adjusted for purchasing power and infla- 
tion; https://data.worldbank.org/indicator/SI.POV.DDAY?). The poor 
experience higher rates of NCDs partly because many causes of NCDs, 
including poor nutrition at all stages of life; inadequate housing, water, 
sanitation and electricity; smoking and elevated blood pressure, tend 
to be associated with poverty>”’”-””. People in poor countries and 
communities, and those with lower income, also have more limited access 
to high-quality healthcare, because health infrastructure, personnel 
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and technologies are more restricted or because they cannot afford the 
out-of-pocket cost of services. Limited primary care leads to low rates of 
diagnosis and treatment, or late diagnosis. For example, fewer women 
are screened and treated for cervical cancer and fewer people with 
hypertension and diabetes are diagnosed and treated in lower-income 
than in high-income countries*****. Late diagnosis, typically when 
complications arise, coupled with more limited specialist care, worsens 
the prognosis of NCDs in low- and middle-income countries compared 
to high-income nations*****. Given its central role in pathogenesis of 
NCDs, the poverty of individuals and communities, and not inherent 
features of the tropics, is the most important determinant of the higher 
NCD rates in the lower-income tropical countries and regions, com- 
pared to high-income nations as well as to the better-off region of Latin 
America and the Caribbean. 


Actions to reduce NCDs in low- and middle-income 
countries 

The data presented in this Review demonstrate that NCDs are an 
important cause of ill health and early death in low- and middle-income 
tropical countries, and after accounting for differences in age, most 
NCDs have a higher mortality burden than in high-income Western 
countries. Therefore, reducing the burden of NCDs is essential for 
improving health in low- and middle-income tropical countries and 
reducing global health inequalities. 

The causes of NCDs in low- and middle-income tropical countries 
are diverse and include poor nutrition and living environment, 
infections, insufficient regulation of tobacco and alcohol, and 
under-resourced and inaccessible healthcare. Although much of the 
epidemiological research has focused on the effect of these factors on 
individual NCDs, they often lead to multiple communicable and NCDs 
in the same people and communities, leading to multi-morbidity*™. 
Multiple co-occurring diseases and risk factors can have amplifying 
effects and lead to overall poor health and loss of productivity. At the 
same time, the shared causes and aetiologies of these conditions, and 
their overlap, provides an opportunity for systematic integration of 
public health and healthcare interventions for their prevention and 
treatment. Doing so requires better data on the patterns of multi- 
morbidity among communicable and non-communicable diseases and 
their risk factors in low- and middle-income countries because they 
may be different from those in high-income countries (Box 3). 

Underpinning these proximal determinants is a background of 
poverty, food insecurity (in terms of both quantity and quality), and 
insufficiency of infrastructure and services related to housing, water, 
sanitation, transportation and healthcare that afflict poor countries, 
communities and families. These challenges are the focus of the 
SDGs. The SDGs have broadened the development agenda of their 
predecessor, the Millennium Development Goals, whose health-related 
goals and targets focused on maternal and child health and infectious 
diseases. As stated earlier, SDGs have a target related to reducing NCD 
mortality, although this target excludes some important NCD causes 
of death in low- and middle-income tropical countries such as kidney, 
liver and other genitourinary and digestive diseases. Alongside this 
NCD mortality target, SDGs have targets for reducing tobacco and 
alcohol use (important causes of NCDs throughout the world), and 
for increasing universal health coverage (essential for enhancing 
timely diagnosis and treatment of NCDs and their risk factors in an 
equitable manner). The momentum generated by the SDGs provides an 
opportunity to turn political attention to NCDs. To succeed in reducing 
the large burden of NCDs in low- and middle-income countries and 
global health inequalities requires (1) identifying interventions and 
programmes that can be scaled up in resource-limited settings to 
effectively lower the burden of NCDs (Box 3) and (2) a substantial 
increase in domestic as well as international financing that supports 
low- and middle-income countries in sustained actions against NCDs. 
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The future of hyperdiverse tropical 


ecosystems 
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The tropics contain the overwhelming majority of Earth’s biodiversity: their terrestrial, freshwater and marine ecosystems 
hold more than three-quarters of all species, including almost all shallow-water corals and over 90% of terrestrial birds. 
However, tropical ecosystems are also subject to pervasive and interacting stressors, such as deforestation, overfishing 
and climate change, and they are set within a socio-economic context that includes growing pressure from an increasingly 
globalized world, larger and more affluent tropical populations, and weak governance and response capacities. Concerted 
local, national and international actions are urgently required to prevent a collapse of tropical biodiversity. 


he tropics hold a disproportionate amount of global biological 

diversity and are key to meeting the international community’s 

aims of socially just sustainable development and effective 
biodiversity conservation. Yet, tropical ecosystems are undergoing rapid 
environmental, socio-economic and demographic change’, often driven 
by forces originating in extra-tropical developed countries. The scale of 
these changes is unprecedented, and decisions implemented in the com- 
ing decades will define the future sustainability of the tropics. 

Guiding these decisions depends on understanding the diversity and 
vulnerability of the four major tropical ecosystems: the forests and mesic 
savannahs that cover most of the terrestrial tropics, the extensive fresh- 
water systems that receive half of the world’s rainfall and the shallow- 
water coral reefs distributed along 150,000 km of coastline (Fig. 1). Here 
we quantify and review the global importance of tropical biodiversity, 
evaluate the vulnerability of tropical ecosystems to proximate stressors 
and assess whether global and regional socio-economic changes will exac- 
erbate or ameliorate biodiversity loss. We then examine the effectiveness 
of conservation approaches and highlight the scientific advances required 
to support a sustainable tropical future. 


The global importance of tropical ecosystems 

Over evolutionary time, the tropics have acted as both a source of and 
a refuge for extra-tropical terrestrial and marine species**; but just 
how diverse and irreplaceable are the tropics today? The increase in 
species richness from polar to tropical regions, known as the latitudinal 
diversity gradient, is found across a wide range of taxa and biomes. As 
a result of this gradient, tropical latitudes—which cover just 40% of 
the Earth's surface—hold a startling proportion of the planet's species: 
our assessment reveals that almost all shallow-water zooxanthellae 
corals, 91% of terrestrial birds, and over 75% of amphibians, terrestrial 
mammals, freshwater fish, ants, flowering plants and marine fish have 
ranges that intersect tropical latitudes (Fig. 2a). For birds, the impor- 
tance of the tropics extends far beyond 23.5 degrees of latitude, given 
that almost half of all Nearctic species migrate to the Neotropics” and 
over 2 billion individual passerines and near-passerines cross the 


Sahara each autumn®. Moreover, a disproportionate number of species 
are endemic to the tropics. For example, there are more than six times 
as many endemic terrestrial bird species in the tropics as in temperate 
regions (Fig. 2a). Tropical zones are less important for marine 
mammals and birds, which peak in diversity at mid-latitudes”®. 
Nonetheless, more than 55% of these species use the tropics (Fig. 2a). 

Overall, 78% of species across the ten taxa that we assessed occurred 
within tropical latitudes, but incomplete taxonomic inventories mean 
that this is almost certainly an underestimate’. Between 15,000 and 
19,000 new species are described annually’, and the majority of 
recently described terrestrial vertebrates'! or predicted discoveries of 
invertebrates!” are from the tropics. Even terrestrial mammals are still 
being discovered at a rate of about 25 species a year, with the highest 
numbers in the Neo- and Afrotropics'’. Shortfalls in species descrip- 
tions for other taxa are often far greater. For example, only 70,000 
of an estimated 830,000 multicellular plants and animals have been 
named on coral reefs!4, and although approximately 500 spider species 
are described each year, this is a tiny fraction of the estimated 150,000 
undescribed tropical species’®. 

Tropical taxonomic shortfalls are further compounded by a suite 
of systematic sampling biases. These include undersampling when 
compared with temperate regions’®, the spatial aggregation of sam- 
pling effort around coastal areas!”, roads, rivers, urban settlements and 
high-profile research stations!®, biases in favour of dry-season sampling 
when many invertebrate taxa are least abundant”, and the paucity of 
samples from ecosystems that are harder to access, such as mesophotic 
and rariphotic reefs”. 

The biological diversity of the tropics is mirrored by many forms 
of societal diversity”!. For example, tropical countries contain 40% of 
the world’s population yet 85% of extant languages are spoken within 
them”. The tropics also provide incalculable benefits to humanity. They 
housed most of the key centres of plant domestication”’ and have been 
a vital laboratory for the development of science itself—the disciplines 
of ecology, biogeography and evolutionary biology are founded on 
evidence gleaned from tropical ecosystems. Tropical ecosystems also 
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Fig. 1 | The tropical biosphere. a, Tropical terrestrial and marine biomes. 
The tropical terrestrial biome (green) was defined as all tropical mesic 
ecoregions'®’. These ecoregions span 82% of the 50 million km? of land 
between 23.5° N and 23.5° S, but extend into the subtropics in some 
areas. The tropical marine biome was defined by the 1988-2018 mean 
minimum monthly 18°C sea-surface isotherm. This isotherm bounds the 
latitudinal extent of shallow-water coral-forming ecoregions (blue)!®4. 

b, The intertropical convergence zone (ITCZ). The ITCZ was defined as 


make vital contributions to globally important ecosystem services. 
Although they cover just 0.1% of the ocean surface, coral reefs provide 
fish resources for the 275 million people that live within 30 km of them™ 
and coastal protection for up to 197 million people’. Humid tropical 
forests cover less than 12% of the world’s ice-free land surface but pro- 
duce 33% of global net primary productivity and store 25% of the carbon 
in the terrestrial biosphere”, and tropical savannahs provide a further 
30% of global net primary productivity and 15% of carbon storage”’. 
Tropical ecosystems also help drive vital atmospheric teleconnections. 
For instance, 70% of the rainfall in the 3.2-million-km? Rio de la Plata 
catchment is estimated to come from evaporation in Amazonia”®. 
Vulnerability of tropical biodiversity 

For each of the five vertebrate groups that have been comprehensively 
assessed by the International Union for the Conservation of Nature 
(IUCN) and for which spatial occurrence data are available”’, species 
classified as Vulnerable, Endangered or Critically Endangered are more 
dependent on the tropics than are those classified as Least Concern 
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regions that received a mid-summer (January (turquoise colour gradient) 
or July (red colour gradient)) mean monthly total rainfall of >20 cm, 

for the period 1979-2017. Where both January and July had rainfall of 
>20 cm, we show the measurement from the month with the largest total. 
The ITCZ is a strong predictor of the distribution of the tropical terrestrial 
ecoregions shown in a. Data sources are presented in Supplementary 
Table 1. 


(Fig. 2b). In addition, 85% of species extinctions from these verte- 
brate groups have been of species that use the tropics”’. Consequently, 
although extinctions of other groups are less well understood, we can 
assume that most of an estimated 130,000 modern invertebrate extinc- 
tions*” will also have been of tropical species. Thus, not only are the 
tropics vastly more diverse than temperate regions, but this diversity is 
also at far greater risk from human actions*!. Moreover, given that the 
tropics have the highest proportion of species classified by the IUCN as 
Data Deficient and the lowest level of biodiversity-threat assessment'®, 
information shortfalls mean we are probably underestimating the vul- 
nerability of the tropical biome. We assessed this vulnerability in more 
depth by examining the effect of local and global stressors, the interac- 
tions between them and the resulting changes to tropical ecosystems. 


Local stressors 

The tropics are subject to some of the highest global rates of land- 
use change and degradation. Since 1990, while the spatial coverage 
of temperate forests has increased, tropical deforestation rates have 
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Fig. 2 | Tropical hyperdiversity. a, The proportion of species found 
within tropical latitudes for ten taxonomic groups. Bars are colour-coded 
to show the percentage of species ranges within the tropics. n gives the 
total number of species analysed in each group. Only birds, amphibians 
and mammals have been comprehensively sampled. Numbers at the end 
of the bars give the precise percentage of species whose ranges overlap 
tropical latitudes, as shown in the bars. b, The difference in the proportion 
of threatened (IUCN Critically Endangered, Endangered and Vulnerable 
categories) and non-threatened (IUCN Least Concern category) species 
found exclusively within tropical latitudes for the five comprehensively 
sampled groups. For example, 66% of threatened and 44% of non- 
threatened terrestrial mammals are confined to the tropics. The bars show 
the difference between these figures, which is 22% in the case of terrestrial 
mammals. Data are from Birdlife International!® (for birds), the IUCN? 
(for amphibians and mammals), the Ocean Biogeographic Information 
System (for marine fish), C. Veron (personal communication) (for 
shallow-water zooxanthellate corals), the Global Biodiversity Information 
Facility (for flowering plants) and a previous publication’® (for freshwater 
fish). Data sources are presented in Supplementary Table 1. 


exceeded five million hectares per year**. Additional effects stem 
from the expansion of large infrastructure projects, such as dams, and 
the growing demand for agricultural commodities, biofuels, timber, 
wood for fuel, and other natural resources”. All of these result in severe 
biotic responses. Even with mitigation, dams present a near-impassable 
barrier for river fish*4, and deforestation replaces a species-rich pool of 
forest specialists with a smaller pool of common open-area species*». 
The influence of land-use change also extends far into remaining natural 
areas through isolation and edge effects*®, additional anthropogenic 
disturbances*’ and altered climatic conditions*®. Edge effects suppress 
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the abundance of threatened vertebrates up to 200-400 m into tropi- 
cal forests*®; this has left almost no core forest refugia in the Brazilian 
Atlantic forest, of which over 80% is within 500 m of an edge*®. Even 
low levels of landscape modification have marked effects on range- 
restricted species*”, and time lags mean that some of the most deleteri- 
ous outcomes are observed only decades after landscape modification”. 

Pollution presents a diverse set of threats to tropical ecosystems. 
Inputs of sediment and nutrients from land-use change are well- 
established drivers of biodiversity loss across freshwater*! and coastal 
systems, including coral reefs*”. Pesticide use is increasing across the 
tropics, reflecting rapid intensification of farming practices* and high 
pressures from pests on tropical crops“. Tropical Asian rivers are a 
major source of the 1.2-2.4 million tonnes of plastic that enters the 
world’s oceans each year*’, with micro-plastics entering coral diets“ 
and larger debris increasing rates of coral disease*”. These examples of 
chronic pollution are exacerbated by extreme events such as the col- 
lapse of the Fundao Dam, which released about 50 million cubic metres 
of waste into a 600-km stretch of river in south-east Brazil and caused 
a 7,000-km/ toxic plume in the Atlantic Ocean®. 

Overexploitation is also pervasive across the tropics. Fishing 
has reduced fish biomass by over 75% across a third of coral reefs” 
and is shrinking the mean body size of exploited freshwater taxa”. 
Hunting has contributed to the loss of charismatic megafauna, extir- 
pating African elephants, rhinoceroses and large predators from most 
of their original ranges*!*. Tropical forests are affected by extensive 
over-harvesting of wildlife*', with estimates of the annual harvests of 
highly trafficked animals such as pangolins reaching into the millions 
of individuals’. Moreover, the growth in non-food uses of wildlife 
means that even small-bodied songbirds are at risk of global extinc- 
tion®*, Overexploitation also extends beyond fauna and is driving eco- 
nomically valuable tropical tree species to extinction®. 

Invasive species have been the second most important driver of 
vertebrate extinctions since ap 1500°°. Within terrestrial ecosystems, 
invasive species have exerted the strongest influence on islands and 
coastal mainlands*’, causing thousands of species extinctions and 
altering trophic structures**. On continents, invasive species currently 
have a greater effect on biodiversity in economically developed and 
extra-tropical regions, but tropical ecosystems are predicted to become 
increasingly vulnerable to invasion as the 21st century continues”. 
Despite a deficit of research in the tropics, two prominent examples 
highlight the scope and magnitude of species invasions into terrestrial 
tropical ecosystems: there was an 84% increase in detections of alien 
species between 2003 and 2010 in Singapore®!, and invasive African 
grasses could threaten up to 380,000 km? of Australia’s savannahs by 
promoting landscape flammability™. In aquatic ecosystems, invasive 
predatory fish—such as the Indo-Pacific lionfish in Caribbean coral 
reefs® or the Nile perch in African lakes“—have contributed to the 
loss of native species. Marine invasions are also facilitated by the mass 
transport of species in the ballast water of ships, which results in wide- 
spread biotic homogenization®. 


Global climate change 
Many of these local stressors are promoted by globalized drivers, but 
climate change is truly global. Increases in atmospheric CO2 concen- 
trations to levels higher than 400 p.p.m. have important implications 
for tropical terrestrial and aquatic ecosystems. Ocean acidification from 
dissolved CO) is changing ocean chemistry to the extent that declin- 
ing coral calcification has already been detected®. Conditions for reef 
accretion and growth may be mostly absent throughout the tropics by 
the year 2100 under ‘business-as-usual’ emission scenarios®”. Within 
savannahs, elevated CO, levels favour the growth of woody plants over 
grasses, contributing to woody encroachment and the potential for a 
switch in biome state*’. CO, fertilization may also have contributed 
to enhanced tree productivity and mortality rates observed in humid 
tropical forests”. 

Global warming is not proceeding at the same rate across the planet. 
Although the greatest absolute temperature increases are occurring 
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Fig. 3 | Vulnerability of tropical biota to local and climatic stressors. 
Species co-tolerance to a local and climate-associated stressor®*. The x 
axis shows responses to fishing for corals (a), reef fish (b) and freshwater 
fish (c); changes in landscape configuration for small-stemmed trees 
(diameter at breast height between 2 and 10 cm (d)) and forest birds 

(e); and fire suppression for savannah birds (f). The y axis represents 
longitudinal responses to climate-associated events: the 2015-2016 and 
1997-1998 coral bleaching events in the Seychelles for corals (a) and reef 
fish (b), respectively; the 1997-1998 El Nifio-induced drought for lower 
Amazonian freshwater fish (c); Amazonian fires during the 2015-2016 

El Nifio event for small-stemmed trees (d) and forest birds (e); and shrub 
encroachment between 1998-2008 in South Africa for savannah birds (f). 
Species relative density is represented from low (light green) to high 

(dark blue). The four quadrants represent the location of ‘survivor’ species 
tolerant to both stressors (green), species only susceptible to local stressors 
(yellow), species only vulnerable to climate-associated stressors (blue) 
and ‘double jeopardy’ species susceptible to both stressors (red). Numbers 
show the percentage of species that fall into the quadrant. n gives the 

total number of species (genera for corals). Data sources are presented in 
Supplementary Table 1. 


at higher latitudes, the tropics are already some of the hottest places 
on the planet and have the lowest inter-annual temperature variabil- 
ity’”. Consequently, they will be the first areas to experience sig- 
nificantly warmer climates than the present day” and will endure 
climatic conditions without present-day equivalents”’. In addition, 
some of the most important climate oscillations—including El Nifio 
and the Indian Ocean Dipole—take place within and exert their greatest 
influence on tropical regions. It is unclear whether these oscillations 
will change in a warming world, but extremes of their phases have 
the potential to exacerbate or ameliorate the overall warming trend. 
One outcome of increasing temperatures is the pole-ward shift of 
species ranges or movement to higher altitudes or deeper depths”>. 
For example, corals in southern Japan are extending northwards at 
about 14 kilometres per year”*, and temperate macroalgal communities 
are being replaced with corals and other tropical species along large 
stretches of Australian coastline’®. Latitudinal shifts in terrestrial and 
freshwater tropical species distributions are less certain, because of the 
many natural and anthropogenic barriers to species movement and 
the low dispersal capacity of many tropical species”®. Furthermore, the 
responses of terrestrial species are defined by changes in rainfall as well 
as temperature”’. 

If movement is not an option, tropical species must adapt or face 
extinction. Unfortunately, there is evidence that some species are 
either approaching their physiological limits or are unable to adapt 
to the current rate of environmental change”®. Increasing ocean 
temperature extremes are driving mass-bleaching events and mortality 
of reef-forming corals, with the time between bleaching events declin- 
ing by 76-80% since the early 1980s”’. Higher temperatures also affect 
tropical vertebrates—for example, by causing an extreme female bias in 
the sex ratio of green turtles in the warmer regions of the Great Barrier 
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Reef* and a reduction in the reproductive success of African wild 
dogs*’. Altered rainfall is also critical. Droughts are drying up biolog- 
ically diverse small streams*’, and even modest changes in dry-season 
length increase tropical tree mortality”” and modify tropical forest bird 
community structure®?, 


Interacting stressors and indirect effects 

Stressors affecting tropical species can interact in myriad ways**. 
We demonstrate this by compiling data from six case studies within 
a co-tolerance framework that enables the examination of species 
responses to two dominant stressors®°. Only a small subset of species 
or genera (8-32%) showed no or positive responses when both stressors 
were combined (Fig. 3), and up to 55% fell within the ‘double jeopardy’ 
quadrant that indicates a negative response to both stressors. Although 
our summary does not quantify the magnitude of effects, it clearly 
demonstrates that stressors can act together to reduce the abundance 
or occupancy of tropical species. Moreover, most tropical ecosystems 
are affected by more than two stressors at any given location and time™, 
and co-tolerance analyses of this type are likely to underestimate the 
reality of human impacts. 

Many changes to tropical ecosystems result from the indirect con- 
sequences of single or multiple stressors. On coral reefs, nutrient 
inputs from the land may increase susceptibility to coral bleaching, 
disease and outbreaks of pests*®°, and pole-ward reef expansion is sup- 
ported by feedbacks from range-shifts in tropical herbivorous fish”. 
Overexploitation can result in surprising changes in tropical ecosystem 
properties through trophic cascades. For instance, the extirpation of 
a single detritivore fish species in the Orinoco basin reduced down- 
stream organic-carbon transport, which in turn increased net primary 
productivity and respiration®”. On reefs, the overfishing of keystone 
predators has repercussions for benthic structure®’, and the removal 
of herbivores can limit coral recovery from mass-mortality events®. In 
mesic savannahs, changes to herbivore numbers alter ecosystem func- 
tions and structure through their interactions with wildfire regimes”. 
Invasive species are also frequently linked to other stressors: the intro- 
duction of the Nile perch had a major role in the decline of endemic 
fish species in Lake Victoria, but these declines were also catalysed by 
a combination of other drivers including soil erosion, eutrophication 
and overfishing™. 


Ecosystems in transition 

Interactions between multiple anthropogenic stressors are caus- 
ing pervasive changes in the tropics, such that alternative states are 
emerging across all major tropical ecosystems (Fig. 4). Perhaps counter- 
intuitively, trees are encroaching on savannahs and grasses are invading 
disturbed tropical forests—however, in both cases the changes are from 
species-rich to species-poor systems™”!, 

These marked ecosystem transitions are accompanied by wide- 
spread modification of species composition. For example, the relative 
abundance of coral species has been altered on reefs that maintain 
coral dominance”; the extirpation of native fish has followed species 
introductions in lakes®; liana biomass has increased in otherwise 
undisturbed Neotropical forests”; and patterns of plant regeneration 
in humid forests have been altered by the overharvesting of seed- 
dispersing vertebrates*!"*. Altered species composition is a cause for 
concern because it could signal the onset of more severe modification, 
especially if dominant species are vulnerable to other stressors or if there 
are cascading implications for ecosystem functioning. The collapse 
of Jamaican coral reefs provides one of the starkest examples. First, 
chronic overfishing depleted herbivorous fish populations, which 
left the system over-reliant on sea urchins for grazing algae. Then, 
in 1980, Hurricane Allen affected the system, creating a substantial 
amount of dead substrate. Although corals began to recover after the 
hurricane, the subsequent mass mortality of sea urchins owing to 
disease—combined with the fact that herbivorous fish were already at 
low levels of abundance—led to a phase shift from coral to macroalgal 
dominance®>”, 
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Fig. 4 | Tropical ecosystems in transition. a, Recurrent wildfires in 


historically fire-free humid tropical forests'®” can lead to the dominance 


of grassy vegetation that impedes succession towards closed-canopy 
forests?! 16°, These wildfires result from the combination of local actions 
(for example, agricultural practices and logging) and climate change that 
has increased the prevalence of weather that promotes wildfires'®’. 

b, Chronic local pressures and acute climatic stressors can lead to coral 
cover being replaced by macroalgae, sponges or sediment-laden turf 
algae®**>. During the 1998 global coral-bleaching event, >90% of live coral 
died in the inner Seychelles and nearly half of the reefs transitioned to 
fleshy macroalgal regimes®’. c, Woody encroachment is occurring in many 
savannahs™, causing biodiversity loss and altered system functioning™. 
Causes are mixed: regime shifts to forest-associated ecosystems have been 
attributed to fire suppression policies (for example, Brazilian cerrado (left) 
to forest (right)!”°), or changes in herbivory and increasing atmospheric 
CO,®. d, The boom in hydropower dam construction is affecting large 
tropical river basins'**. The transformation from lotic to lentic conditions 
reduces access to riparian and floodplain habitats that are nursery areas 
and feeding grounds for many of the species occupying higher trophic 
levels, leading to major shifts in species composition and ecosystem 
function®”. Images from J.B. (a), N.A.J.G. (b), G. Durigan (c) and C.G.L. 
(d), used with permission. 


Socio-economic context and response capacity 

The interacting proximate stressors causing tropical environmental 
change are underpinned by broader changes in socio-economic and 
political factors. We examined the trajectories of four types of underly- 
ing distal drivers, including demography and the economy (Fig. 5a, b), 
socio-political factors (Fig. 5c, d), markets (Fig. 5e, f) and technology” 
(Fig. 5g, h) to explore how tropical countries are changing relative to 
the rest of the world and to evaluate the relative influence of local and 
global drivers. We also examined how the capacity of tropical countries 
to reduce or cope with proximate stressors compares to non-tropical 
countries based on underlying governance systems (Fig. 5i, j) and 
research capacity (Fig. 5k, 1). 
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The immense biodiversity of the tropics exists in the context of rapid 
demographic and economic growth (Fig. 5a, b). The human population 
is growing at a faster rate in the tropics than elsewhere (Fig. 5a), and 
by 2050 half of the world’s population will live in the tropics”. These 
demographic changes are accompanied by a steady growth in gross 
domestic product (GDP) that is linked in part to the rapid expansion 
of agricultural and extractive industries. However, in the tropics, per 
capita GDP—which is an important measure of human well-being— 
remains far lower than the non-tropical average (Fig. 5b) and the rates 
of change suggest that there has been little closing of the global inequal- 
ity gap. Although the relationship between development and natural 
resource conservation does not have to be negative”**’, measures that 
reflect higher social performance are almost always associated with 
higher resource use”. A larger and more affluent tropical population 
will increase demands for timber, water, food, energy and land, all of 
which are strongly linked with environmental degradation. 

These internal changes will be exacerbated by economic growth in 
non-tropical countries and the continued displacement of environ- 
mental effects to less-developed areas’. Indeed, despite high levels 
of tropical cultural diversity”!’, external socio-political influences 
(Fig. 5c, d) suggest that tropical countries have become increasingly 
susceptible to globalization. For example, the proportion of imported 
food crops (Fig. 5c) and foreign land acquisitions are far higher in 
the tropics than elsewhere (Fig. 5d) and are associated with extensive 
road building!” and agricultural investment”. These trends towards 
increasing tropical globalization are reinforced by changes in market 
integration (Fig. 5e, f) and technological development (Fig. 5g, h). For 
example, agricultural exports (Fig. 5f) are steadily increasing, albeit 
from a far lower baseline than the rest of the world. Moreover, given 
comparatively low levels of adoption of technological developments, 
such as industrial fishing techniques (Fig. 5g) or fertilizers (Fig. 5h), 
there is an enormous risk that the rate of natural resource extraction in 
many tropical countries will increase further, to supply both domestic 
and export markets'™!. Taken together, these examples highlight the 
crucial role that external markets will have in determining the fate of 
tropical ecosystems. 

Effective environmental governance (Fig. 5i, j) is a necessary con- 
dition for improved sustainability outcomes!™, particularly when 
domestic (Fig. 5a, b) and global (Fig. 5c—f) distal drivers are expected to 
exert increasing and unsustainable pressure on tropical ecosystems”, 
However, national-level assessments of governance effectiveness place 
the tropics far below extra-tropical countries, with no sign of improve- 
ment (Fig. 5i). External support for environmental governance may 
help where local governance is weak. Yet, despite the biological impor- 
tance of the tropics, levels of environmental aid from the Organisation 
for Economic Cooperation and Development (OECD) are only 
marginally greater in the tropics than elsewhere (Fig. 5j), and these 
investments are dwarfed by the value of domestic resource extrac- 
tion (for example, agricultural exports; Fig. 5f), the value of which 
is two orders of magnitude greater than overseas environmental aid. 
Furthermore, OECD environmental aid has been declining in recent 
years and seems unlikely to increase in the short term). 

Low governance capacity in the tropics is further exacerbated 
by insufficient research and development investment (Fig. 5k) 
and low levels of scientific output (Fig. 51). Research investment is 
critical for driving innovation and the development of evidence-based 
solutions to environmental degradation!””. Despite some notable 
centres of excellence, the vast majority of biodiversity-related data 
and research is concentrated in wealthy, non-tropical countries!’, 
and manuscripts submitted by authors from low-income countries 
are less than half as likely to be published as those from high-income 
countries”. These trends highlight an alarming disconnect between 
the global scientific process and the people that are most capable of 
engaging with decision makers in tropical countries, who have the 
best understanding of local context and, arguably, have the strong- 
est incentive to achieve positive outcomes for tropical conservation 
through their research. 
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Fig. 5 | Socio-economic drivers of biodiversity loss and societal 
response capacities. Green lines represent countries with >50% of their 
area within tropical latitudes; purple lines represent all other countries; 
grey-shaded areas represent the proportion of the global total within 
tropical countries. a, Population (1960-2016). b, GDP per capita (2011 US 
dollars, based on purchasing power parity; 2000-2016). c, Foreign food 
crops (1961-2009). d, Cumulative overseas land ownership (2001-2017). 
e, Domestic and international airline passengers (1970-2016). 

f, Agricultural and forestry commodities export value (2001-2016). 

g, Bottom and pelagic trawler catch tonnages (1960-2014). h, Total 
fertilizer (nitrogen, potash and phosphate) consumption relative to crop 
area (2002-2013). i, Government effectiveness index (2000-2016). WGI, 
World Governance Indicators. j, Environmental aid (2000-2016). k, Public 
and private sector research and development expenditure as a percentage 
of GDP (2000-2015). 1, Scientific and technical journal articles per million 
people in the fields of physics, biology, chemistry, mathematics, clinical 
medicine, biomedical research, engineering and technology, and Earth and 
space sciences (2003-2016). Data sources are presented in Supplementary 
Table 1. 


Diverse solutions for diverse systems 

Tropical ecosystems—and therefore at least 78% of global biodiver- 
sity (Fig. 2a)—are at a critical juncture. Multiple interacting local 
and global stressors (Fig. 3) that are driving species extinctions and 
potentially irreversible ecosystem transitions””! (Fig. 4) are set 
within a changing socio-economic context (Fig. 5). This chang- 
ing context is characterized by growing and more affluent popula- 
tions, an increasingly globalized world, and weak governance and 
research capacity— all of which threaten to increase environmental 
degradation, conflict and inequality’’. Countering these threats 
requires major improvements in local and global governance capacity 
and a step-change in how environmental objectives are integrated 
into broader development goals!!°. We review the opportunities and 
limitations presented by three well-established and non-mutually 
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exclusive approaches to conservation, before highlighting priorities 
for research. 


Conservation approaches 

A fundamental element of tropical conservation relies on protected 
areas to limit demographic pressures and the effect of local stressors. 
These are supported by a wealth of scientific evidence outlining the 
pervasive effect of local stressors across tropical ecosystems*”” (Fig. 3) 
combined with an eco-centric philosophy that emphasizes the intrinsic 
rights of nature'!'. Despite a substantial expansion of protected area 
coverage in the marine and forested tropics!!?, the current network 
remains poorly designed, has very limited coverage of tropical fresh- 
waters and grasslands, and is inadequately resourced''?. Moreover, a 
strategy focused solely on protected areas may not foster environmental 
conservation outside of reserves!!* and fails to engage with the distal 
drivers of biodiversity loss (Fig. 5) that can undermine the effectiveness 
of protected areas themselves!!°. 

A second set of approaches for tropical conservation is based on the 
notion that people need to perceive the benefits of nature to justify 
conservation. These approaches emphasize the need to pursue con- 
servation objectives in human-dominated landscapes, the provision of 
ecosystem services and the involvement of private-sector actors. In the 
tropics, they are epitomized by the growth in market-based conserva- 
tion payment mechanisms, such as REDD+!"6, investments in the ‘blue 
economy’!”” and a step-change in the number of companies making 
sustainability commitments’!*. These approaches have strengthened 
the conservation toolkit, especially where strict regulatory approaches 
have failed. Encouraging examples range from the positive effects of 
commodity certification (for example, palm oil!!*) to payment for 
ecosystem service schemes (for example, watershed protection'”°). 
However, such approaches also attract considerable criticism, with 
implementation often lagging behind commitments''’, persistent con- 
cerns around the social legitimacy of compensation schemes’”! and 
the misalignment of market-based mechanisms with local needs and 
perceptions of environmental values”. 

A third and more diverse set of approaches is based on the 
recognition of the interdependencies between people and nature, 
the coevolution of ecological and socio-economic systems at local, 
regional and global scales'”’, and perspectives about the co-existence of 
people and nature. This set of ‘systems-based’ approaches includes: 
(1) an appreciation of the importance of bottom-up, community-based 
conservation approaches in human-dominated land- and seascapes (for 
example, small-scale fisheries!*4 and community-managed forests!?>); 
(2) recognition of the role of indigenous people as environmental stew- 
ards and shifts towards an appreciation of more collective relationships 
with nature (for example, the Ecuadorian constitution!”°); (3) land- 
scape- and ecosystem-wide approaches that attempt to bridge the role 
of actors working at different scales and in different sectors (for exam- 
ple, jurisdictional approaches to curb deforestation'””); and (4) a more 
explicit accounting of multi-scale feedbacks, including the role of distant 
market actors and distal drivers!”*. These broad, multi-layered ‘people 
and nature’ approaches hold considerable appeal, but the inherent 
complexity of local contexts can make them challenging to concep- 
tualize, implement and measure in joined-up and consistent ways’”*. 


Acting together and acting now 

The three approaches to the conservation and governance of tropi- 
cal ecosystems outlined above are often associated with alternative 
researcher and practitioner worldviews!?”!5°, However, the ecological 
diversity (Fig. 2a), vulnerability (Figs. 2b, 3) and socio-economic com- 
plexity (Fig. 5) of the tropics highlights the importance of pluralism'*! 
and the need to adopt a variety of what are often complementary and 
synergistic approaches!*°. For all their deficiencies, protected areas are 
indispensable to limit the effect of local stressors, and it will be impos- 
sible to avoid further biodiversity loss unless they are strengthened 
and expanded'**. However, conservation strategies must also address 
the underlying drivers of environmental change (Fig. 5) and avoid 
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exacerbating deeply rooted inequalities!!*. Practice is always messier 
than theory, and the adoption of more sustainable management systems 
is usually only possible with the support of a range of actors, as can be 
seen in the recent successes of some hybrid governance approaches, 
with government, the private sector and civil society organizations all 
having vital roles!*°. 

Another clear message is that conservation efforts need to operate 
at local, regional and global scales to be effective. Many distal drivers 
are disconnected in both space and time from the sites they affect, and 
the engagement of external actors—including in distant markets and 
governance processes—is often essential to ensure that local efforts are 
effective. These include more strategic integration of environmental 
policy with development goals!*4, the need for multinational environ- 
mental governance approaches, especially for aquatic systems®, and 
recognition of the importance of tackling demand for unsustainable 
products from downstream buyers and investors!!®. The capstone of 
such efforts lies in the urgent need to deliver on the Paris Agreement, 
without which climate change will undercut or even negate hard- 
won local conservation successes, whether in coral reefs”” or tropical 
forests'™. 

Finally, we need to act now to address the pressing environmental 
challenges facing the tropics. This means being adaptive, learning by 
doing and embracing innovation. The past decades have seen a boom 
in proposals, innovations and insights about the governance and 
management of tropical ecosystems, ranging from more technocen- 
tric proposals to facilitate the evolution of climate-tolerant corals!*°; 
ecological engineering to recover lost trophic interactions by species 
re-introductions, ecological replacements and rewilding!>*, to radical 
new legal frameworks such as France’s ‘Loi de vigilance’ (2017-399; 
https://www.legifrance.gouv.fr/eli/loi/2017/3/27/2017-399/jo/texte) 
that places an unprecedented due diligence obligation on major compa- 
nies to assess social and environmental risks in their supply chains that 
extend beyond French borders. Though these innovations serve differ- 
ent purposes and are varyingly scalable, they illustrate the potential of 
solutions-based science and conservation. Of course, acting now does 
not mean ignoring the existing evidence base or making uninformed 
decisions. Rather, it is vital that researchers and decision makers are 
vigilant to opportunities and risks and are willing to learn lessons. 


Keeping pace with the Anthropocene 

All approaches to governing tropical ecosystems will be more effective 
if they have local support and are based on strong scientific evidence 
that ensures, for example, that protected areas are located where they 
are most needed, ecosystem services are accurately quantified, extrac- 
tive activities such as fishing and logging are managed sustainably, and 
underlying drivers of environmental degradation are identified and 
understood. Although these challenges are common to all conservation 
and sustainability science, they are magnified in the tropics owing to 
the unique diversity and high vulnerability of tropical ecosystems and 
the low research capacity of most tropical countries. Here we examine 
four areas in which research effort can be more closely aligned with 
some of the priorities highlighted by this review. 


Addressing key knowledge shortfalls 

Our understanding of tropical biodiversity is limited by substantial 
shortfalls in knowledge regarding taxonomy and species distribu- 
tions’*”. Overcoming these shortfalls will require targeting resources 
towards the data-scarce regions that cover so much of the tropics’®. At 
the ecosystem level, there is a need for increased study of structurally 
and functionally distinct systems, particularly tropical grassy biomes®, 
dry forests'** and low-order stream systems'*”. Progress in these areas 
is likely to be aided by advances in DNA sequencing and informatics, 
which have the potential to invigorate taxonomic discovery, and by 
reaching across cultural divides to incorporate national, regional and 
local knowledge that often remains ignored because it is not available 
in English™®, included in standard databases"! or recognized by con- 
ventional science!” 


REVIEW 


Understanding vulnerability 

Our growing knowledge of the role of individual stressors, such as land- 
scape configuration or overexploitation, needs to be complemented 
by research on the effect of multiple stressors**, which could help 
predict and mitigate complex biotic responses when climate and local 
stressors act in concert (Fig. 3). Other phenomena that are important 
but harder to study include the role of time lags or extinction debts”, 
trophic cascades"! and trajectories of ecosystem degradation and 
recovery in the face of unprecedented environmental change’. 
Revealing these more-complex forms of vulnerability will often 
demand longer-term and larger multi-scale sampling and monitoring 
programs. New approaches are also needed to overcome one of the 
more intractable challenges of tropical ecology: the fact that we often 
know least about the rarest and most vulnerable species or taxonomic 
groups. 


Understanding distal drivers 

Conservation does not occur in a vacuum, and local interventions 
are likely to be much more effective if they are guided by a closer 
understanding of underlying distal drivers of biodiversity loss and 
environmental change, including identifying the actors behind such 
drivers, which will help to determine potential trigger points and 
identify more effective policy responses”’. Unpicking the role of distal 
drivers is essential to understand how interactions between social and 
environmental systems shape local environmental outcomes!“*. Careful 
study has revealed many surprising interactions, such as links between 
the intensification of commercial fishing and increased bushmeat 
exploitation in west Africa’, the role of warfare in driving African 
mammal declines’ or the role of currency exchange rates in driving 
deforestation’’’. Achieving this deeper understanding requires greater 
integration of the natural and social sciences, with interdisciplinarity 
included as a core element of tropical conservation research'“8, 


From research to impact 

Achieving positive effects from conservation research relies on building 
a stronger interface between science and society that challenges the 
oversimplified assumption of a linear flow from knowledge to action’. 
Engendering positive changes will require closer participation of 
practitioners in the research process and investments in outreach 
activities and professional capacity building!”’. These will be sup- 
ported by studying the knowledge exchange process itself, including 
the critical part that is played by knowledge brokers and boundary 
organizations!°°-!*”, Part of this process will require a focus on success 
stories or ‘bright spots, which will enable the social, institutional and 
environmental conditions that create positive outcomes to be identified 
and replicated'*!. The positive social and ecological outcomes from 
innovative restoration and rewilding programmes in Costa Rica and 
Mozambique demonstrate the potential for positive action’. 

Local managers and scientists have a vital role in designing and 
implementing research that can inform regionally appropriate conser- 
vation actions!™, At present, our knowledge of hyperdiverse ecosystems 
is over-reliant on inferences gleaned from distant research stations or 
inappropriate theoretical constructs developed for temperate con- 
texts'®!°°_ Research is also more likely to have an effect if the spatial 
scale of studies is closely matched to the administrative scale at which 
resource decisions are taken’*°. Sustaining research programmes and 
learning networks in study landscapes can help build vital relationships 
between researchers, local knowledge holders and decision makers!**, 

Achieving these changes requires building on trends in the tech- 
nological, disciplinary and cultural dimensions of research practice. 
In the technological domain, opportunities for data collection have 
been revolutionized by developments in remote sensing and drones!*”, 
the plummeting costs of DNA technologies'*® and the step-changes in 
bioinformatics that have enabled ‘big data to be stored and retrieved in 
open-access platforms!»’. In the disciplinary domain, the past decade 
has seen a marked upward trend in interdisciplinary and transdisci- 
plinary research and a greater—albeit still insufficient—integration of 
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natural and social sciences. This has resulted in an increasing openness 
of researchers towards methodological pluralism and mixed-method 
approaches!” and a growing recognition of the contribution that can 
be made by local people and citizen- and para-scientists in biodiversity 
research'®°. Changes in research culture include the greater interna- 
tionalization of ecological science and closer approximation with soci- 
ety!?, both of which can help foster more fertile ground for knowledge 
exchange and capacity building. Notable advances include the devel- 
opment of multi-disciplinary and multinational learning networks’, 
exponential growth in author teams’ and major syntheses such as the 
Intergovernmental Platform for Biodiversity and Ecosystem Services. 

Recent years have seen an awakening of environmental conscious- 
ness and calls for decisive action that are manifest, for example, in 
the Paris Agreement, the Sustainable Development Goals and volun- 
tary Zero Deforestation Commitments. Scientists from tropical and 
non-tropical regions can inform these endeavours by developing a 
reliable knowledge base and innovative management interventions. 
Overcoming the remaining research challenges is far from trivial and 
will require a massive investment of resources to develop scientific 
infrastructure and capacity within tropical nations, as well as profound 
changes to ways of working and the relationship between the research 
process and society at large. But a failure to act decisively and to act 
now will greatly increase the risk of unprecedented and irrevocable 
biodiversity loss in the hyperdiverse tropics. 
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The tropical forest carbon cycle and 


climate change 


Edward T. A. Mitchard!* 


Tropical forests make an approximately neutral contribution to the global carbon cycle, with intact and recovering forests 
taking in as much carbon as is released through deforestation and degradation. In the near future, tropical forests are 
likely to become a carbon source, owing to continued forest loss and the effect of climate change on the ability of the 
remaining forests to capture excess atmospheric carbon dioxide. This will make it harder to limit global warming to 
below 2 °C. Encouragingly, recent international agreements commit to halting deforestation and degradation, but a lack 
of fundamental data for use in monitoring and model design makes policy action difficult. 


carbon concentrations of the industrial age. They have acted both 

as a very important source of emissions, as a result of logging and 
burning, and also asa carbon sink, as the remaining forests have taken in 
much of the extra carbon added to the atmosphere. As an example, from 
1960 to 2015, anthropogenic emissions totalled 408 Pg carbon (Pg C), 
of which 80% arose from burning fossil fuels and the production of 
cement, and 20% from changes in (largely tropical) land use’. However, 
the atmospheric CO; stock grew by ‘only’ 180 Pg C over that period”, 
meaning that 55% of these emissions were taken in by the Earth system, 
reducing the magnitude of the induced climate change. This sink has 
three main components—the oceans, Northern Hemisphere forests and 
tropical forests'—and although there are uncertainties around the relative 
contributions of each, between a quarter and a third is normally attributed 
to the enhanced growth of trees in tropical forests*~°. Understanding the 
size and the causes of this sink is crucial for predicting its evolution over 
the coming century: the tropical land sink is known to be very variable 
year to year’”-!°, and reverses to become a source in hotter years!’. This 
suggests there is a real risk that, over the coming decades under climate 
change, it will become a major source of carbon every year. 

The tropical land sink is the least certain major component of the global 
carbon budget. There are various possible ways of estimating its size 
(Box 1), but none estimate the sink directly, and all have high uncertainty 
due to either sparse sampling®!?~!* or coarse resolution”®!!. As a result, 
the main way in which the land sink has been estimated is as the residual 
of the sum of all other components of the global carbon cycle’; however, 
with this method it is not possible to estimate the relative contributions 
of the Northern Hemisphere forests and the tropical forests to the sink. 
In addition, some other components of the global carbon cycle are also 
very uncertain and variable, such as the land-use change flux’’”, making 
the accurate estimation of trends in the sink very difficult. This uncer- 
tainty greatly limits the development and testing of theories and models, 
and thus means that there is a wide divergence of predictions as to how 
the sink will change under different climate-change scenarios and policy 
interventions. 

Considering all sources of evidence, it appears probable that as the 
intact forest sink declines in size, tropical forests are in the process of 
switching from being approximately neutral to being a net source of car- 
bon!+!519. This decline is caused by the combination of a decrease in the 
area of intact forest”?! as well as an increase in temperatures and drought, 
which reduces the ability of trees to respond to higher CO2 concentrations 


T ropical forests have had a critical role in the changing atmospheric 


by growing faster'>!*. With both forest loss and climate change likely to 
accelerate over the 21st century, tropical forests are likely to release ever 
more carbon, which will make limiting global warming to less than 2 °C 
above pre-industrial levels very difficult?””*. 


The carbon balance of tropical forests 

Living tropical trees store 200-300 Pg C (refs %’4-*6), about a third as 
much as is held in the atmosphere’. This stock is very dynamic: trop- 
ical trees perform about 60% of the world’s photosynthesis, capturing 
around 72 Pg C from the atmosphere every year”’, but also release a 
similar amount back to the atmosphere through respiration of both the 
plants themselves and the animals, microorganisms and fungi of the 
ecosystem'*”8, Given these large fluxes, a small proportional change in 
either the uptake or the release of CO} can result in a large net source 
or sink. There are several lines of evidence that, over at least the past 
50 years, these two processes have been out of balance, with tropical 
vegetation increasing in biomass by more than 2 Pg C yr~', equivalent 
to about 1% per year 192930, However, it is clear that the interannual 
variability of this sink is very high, driven by fluctuations in tempera- 
ture and rainfall’7?)*”. 

The tropics are also the main nexus of global land-use change, with 
deforestation and forest degradation (where some trees are removed 
but the area retains sufficient trees to be classed as a forest) releasing 
somewhere between 0.5 and 3.5 Pg C yr! (refs ®7!39-8). The wide 
range of estimates can be partly attributed to differences in time period, 
but is mostly caused by differing definitions and included processes, 
different methods (Box 1) and wide uncertainty bounds. 

The comparison of different methods shows a consensus that the over- 
all carbon balance of the tropics has been approximately neutral since at 
least the 1990s, with sinks in intact and regrowing forests equal in mag- 
nitude to sources from deforestation and forest degradation>®* (Fig. 1). 
However, it is also clear that in abnormally hot years, such as during 
strong El Nifio events, the tropics become a major net source”! (Fig. 1d). 

The following section examines the current magnitude of the major 
carbon sources and sinks. Next, the evidence for trends in the sources and 
sinks over recent decades is considered, along with their probable future 
pathways, and whether international policy can change these trends. 


Carbon sources 
Deforestation is easy to map using optical satellite data. Free Landsat 
satellite data enable most countries to produce their own maps”; 
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Box | 
Methods used to assess the tropical 
forest carbon balance 


Forest inventory plots. Marked areas of forest where tree 
diameters are measured and species recorded, enabling 
estimation of tree mass®°. Revisiting networks of such plots 
approximately every five years gives precise estimates of how 
forest carbon stocks are changing!?!+5”, although uncertainties 
are increased because plots are rare and unevenly distributed, 
with some forest types undersampled. There are also plots 

that are intensively monitored to give insight into the detail of 
carbon allocation and use efficiency®, and rare experimental 
manipulations that test the response of trees to conditions that do 
not naturally exist”>9!. 

Atmospheric inversions. There is a sparse network of 

towers and marine measurement sites across the tropics 

that permanently collect greenhouse gas concentration and 
micrometeorological data. These are supplemented by ship and 
aircraft data, and combined with atmospheric transport models to 
estimate the net flows of COz into or out of the atmosphere at a 
broad, regional scale>1!516, 

Satellites. Can be used to estimate: 


(1) Forest area. Landsat satellites have been used to produce 
consistent estimates of forest cover change since the early 
1970s. Many countries produce their own maps, and global 
30-m resolution forest change data are available from 2000 
onwards”°. However, loss data are much more reliable than gain 
data, and relating the area-based data to changes in carbon stock 
is difficult. 

(2) Carbon stocks. A unique LiDAR satellite operating in the 
mid-2000s collected distributed estimates of tree height in 
70-m footprints, which were combined with field plots and 
other satellite data to make medium-resolution (500 m-1 km 
pixels) carbon stock maps?*5, albeit with large uncertainties*°. 
These maps enable estimates of emissions when combined 
with forest area change data®®, or when produced annually®. It 
is also possible to estimate carbon stock changes using passive 
microwave remote sensing?’, however the resolution (1-2 times 
coarser) makes it impossible to separate gain and loss fluxes. 
(3) GHG concentration. Satellites can measure the greenhouse 
gas concentrations in narrow columns of the atmosphere with a 
precision of approximately 1 p.p.m. CO2z. These measurements 
have been used to observe directly the carbon entering and 
leaving tropical forests, giving information about the size of the 
tropical forest sink and its reaction to droughts at a continental 
scale”!!, However, cloud cover means that the observations are 
sparse in time and space, and the coarse resolution once again 
means forest loss and gain fluxes cannot be distinguished. 
Modelling. Given the difficulty in directly observing forest 
responses to increasing CO2 concentrations and climate changes, 
dynamic vegetation models are used directly to predict these 
responses®!8, The latest generation of models, Earth system 
models (ESMs), include many more processes and feedbacks 
than do traditional atmosphere—ocean general circulation models 
(AOGCMs), which increases their predictive power’!. Models 
provide information on processes or time periods for which we 
have no other data, and enable us to use our current knowledge 
about the Earth system to predict the future under specific 
scenarios of climate and land-use change. 


large-scale independent maps are also available and are broadly 
consistent””*?. Deforestation affects very large areas: about 100 Mha 
was deforested in the tropics from 2000-2012; of this, about 50% was in 
Latin America, 30% in southeast Asia and 20% in Africa (ref. 7°, using 
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a forest definition of greater than 25% canopy cover; similar values are 
found in other studies”)*). The main drivers of this deforestation differ 
by location: mining and large-scale commercial agriculture or pasture 
dominate in Latin America; palm oil, pulp and paper plantations 
in southeast Asia; whereas in Africa, deforestation has been largely 
driven by smallholder agriculture, with major contributions only more 
recently from mining, commercial agriculture and plantations*?. 

Estimating the amount of carbon released from deforestation is more 
difficult than assessing its spatial extent. Estimates are often produced 
by simply multiplying the area deforested by a single carbon density 
per unit area value. The result is therefore very sensitive to that single 
value, which is usually the mean carbon density from a number of local 
forest inventory plots; however, to be accurate they must be numer- 
ous and representative of the type of forest removed. At a pantropical 
scale, recent studies have improved on this by overlaying the deforest- 
ation data on continuous maps of carbon density*333°3*, However, 
errors can arise from the carbon data having a coarser resolution than 
the deforestation data, and the carbon maps having potentially large 
regional biases*°“°. Overcoming these issues, there has been some con- 
sensus in recent years that the flux from gross tropical deforestation in 
the 2000s was 0.6-0.8 Pg C yr“ (refs **°*9). 

By contrast, it is much harder to estimate the area affected and the car- 
bon losses caused by forest degradation*"”. This is partly because deg- 
radation is caused by a wide variety of processes with different effects, 
including commercial logging, fuelwood extraction, sub-canopy culti- 
vation, grazing, fire, and edge effects caused by nearby deforestation*. 
Further, it is because the only remote sensing methods that are sensi- 
tive to degradation have coarse resolution, with each pixel containing 
between twenty hectares and thousands of hectares?”811:15163744 which 
far exceeds the size of most degradation events (less than 1 ha)‘, This 
means that estimates inevitably mix the fluxes from deforestation, forest 
degradation, regrowth of previously disturbed forest, as well as changes 
in intact forest, into a single combined change per pixel. 

Some studies have used inventory plots to estimate fluxes from 
degradation*!**. However, these give numbers on a per-hectare basis 
that are hard to scale, as we do not have maps of degradation. High- 
resolution remote sensing from light detection and ranging (LiDAR)”” 
or synthetic-aperture radar*, combined with local field biomass plots, 
can directly map the changes in carbon stock from deforestation, 
degradation and regrowth at a suitable resolution, but so far such 
studies are rare and have been used only for small areas, so cannot be 
of much help for pantropical estimates. They can, however, show the 
broad ratio between carbon losses from deforestation and degradation; 
although this varies widely in space and time, there is a suggestion that, 
at a large scale, degradation is responsible for perhaps twice the carbon 
release of deforestation, with great regional variation®». Furthermore, 
there is agreement that degradation is more important as a propor- 
tion of total emissions in Africa than in South America or southeast 
Asia30414248. 

Tropical peat forests are independently a major potential source of 
carbon. Peat is carbon-rich, partially decayed organic matter, associ- 
ated with waterlogged and acidic conditions, which exists in layers up 
to 20-m thick under tropical swamp forests. Recent large discoveries 
under the forests of the Congo”” and Amazon” basins have increased 
the known area of tropical peat by 50% to 577,000 km? (combining 
figures from refs “”>'), These peat forests have very high carbon densities, 
meaning that they have the potential to make an outsized contribution 
to the global carbon cycle: about 5% of tropical forests overlay peat, 
but they store 70-130 Pg C (ref. 4”), which is considerable compared 
to the 200-300 Pg C stored in all tropical trees®”*+-7°. The majority of 
tropical peat is in southeast Asia, but it has been extensively cleared 
and drained in recent decades (over half the area present in 1990 had 
been deforested or degraded by 2008°*°%), therefore contributing con- 
siderably to land-use change emissions by releasing 0.3-0.54 Pg C yr! 
(refs 41”). This large flux is included in the land-use change numbers in 
Fig. 1, but is excluded from normal values of the deforestation flux (for 
example, refs 8,33,36,38) as such methods exclude belowground carbon. 
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Fig. 1 | Tropical forest carbon fluxes assessed using different methods. 
Annual fluxes (in Pg C yr~') into and out of tropical forests for different 
overlapping time periods (a-c) and for a recent El Nifio year (d). The net 
intact forest flux is shown in turquoise, the net flux in regrowing forest is 
in orange, and the deforestation and forest degradation flux (including 
fire) is shown in pink. Panels a-c show that there is broad agreement 
that the tropics have made an approximately neutral contribution to 
atmospheric carbon stocks in the recent past, but d shows that intact forest 
can become a carbon source in hot and dry years, leading to considerable 
net emissions from the tropics. Data in a are from networks of forest 
inventory plots®, combined with forest area data from country surveys 


Furthermore, intact, degraded and drained peatlands in southeast Asia 
have been subject to fires in El Nifo years that have released much 
larger quantities of carbon: up to 2.5 Pg C ina single year, sufficient to 
cause noticeable anomalies in the atmospheric CO) growth rates'>**. 
By contrast, the peatlands of the Congo and Amazon basins were until 
recently largely undisturbed, and so are not currently thought to con- 
tribute considerably to the land-use change flux”. 

However, even undisturbed peatlands are probably losing carbon 
owing to climate change*?!*. This is hard to monitor because satellites 
can see only the trees, but the vast majority of the carbon in peat-forest 
ecosystems is instead stored belowground as peat. Gruelling fieldwork 
to ascertain peat depths and extract cores for chemical analysis is nec- 
essary to ascertain carbon stocks, but these point estimates are hard to 
scale to large areas owing to great spatial variability’. The tracking 
of losses is further complicated because of the range of mechanisms 
through which peats can lose carbon: respiration in peats releases CHy 
as well as CO2; burning releases CO and C in addition to CO,; and dis- 
solved and particulate organic carbon is washed away in rivers. There is 
some data on non-CO, emissions: both satellite and modelling datasets 
suggest that all tropical peatlands are substantial methane sources”, 
and field data suggests that fluvial organic carbon transport is con- 
siderable for both intact and disturbed peats in southeast Asia, and 
has increased by more than 30% from 1990 to 2008°". Although more 
baseline data are needed, it seems likely that warming and droughts 
caused by climate change are resulting in peat forests becoming net 
sources of carbon*!?5}°?, 


Carbon sinks 
The remeasurements of millions of trees in networks of forest inventory 
plots across the undisturbed forests of Latin America, Africa and south- 
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or ref. *®. In b, values are obtained from annual 463-m resolution optical 
satellite data®, calibrated using LiDAR data and field plots from the 
mid-2000s. Intact and regrowth fluxes are not separated in this method. 
The figures in the study have been grossed up from biomass to total 
carbon stock change (that is, including dead wood, litter, soil) using the 
data in table 2 of ref. ° (adding 16%). Data in c are derived from looking 
for overlap between atmospheric inversion, modelling and field-plot 
estimates”. In d, data are obtained from satellites sensitive to atmospheric 
CO) concentrations for the 2015 El Nifio year'!, which contrast sharply 
with the other estimates shown. Change in land use could not be divided 
into separate regrowth and loss fluxes in this method. 


east Asia suggest that these forests have all been gaining carbon at a simi- 
lar rate of around 0.5 Mg C ha~' yr“! since at least the late 1970s, adding 
up to a total sink of a little over 1 Pg C yr“! (refs &!4978), Although it 
has been suggested that artefacts in plot remeasurements could lead to 
erroneous findings of increasing carbon storage with time”, there is also 
considerable evidence of a sink of around this magnitude from inde- 
pendent methods, such as atmospheric inversion studies*'*!°, satellite 
data®*” and models’, so there is little doubt that it exists. 

Regrowing and disturbed forests are also clearly taking in carbon 
from the atmosphere, but as with forest degradation, there is little relia- 
ble data on the magnitude of this sink. Studies that track individual field 
plots show great variation: after total clearance there was no increase 
in biomass at all after 20 years at a site in Uganda, but an increase of 
over 10 Mg Cha7' yr7! throughout the first 10 years in moist sites in 
Latin America’?. A meta-analysis of 1,468 plots in 45 sites found average 
recovery rates of 3.05 Mg Cha _'yr7" for the first 20 years, and that sites 
regained 90% of old-growth biomass values after a median of 66 years’ 
(though biodiversity does not recover in these timescales!?**®°). As we 
do not have good maps of past disturbance it is hard to convert plot values 
into tropical estimates, but these data are consistent with inversion studies 
and satellite observations of a current flux with a similar magnitude to the 
intact forest sink (that is, 1 Pg C yr~'), with large uncertainty*©. 


Trends in the sources and sinks 

Despite the uncertainties related to individual components, we do have 
a reasonable understanding of the carbon balance of the tropics in the 
recent past, with various methods agreeing that the tropics make an 
approximately neutral contribution to the global carbon budget*?*?7 
(Fig. 1). However, we are much less certain about how the system is 
changing. 
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Fig. 2 | Contradiction between the main datasets of forest area change. 
Satellite datasets and nationally reported statistics are in agreement 

about the rate of net tropical forest loss in the early 2000s, but diverge 
increasingly over time. The pink points and trendline (quadratic ordinary 
least squares) show net annual forest loss from a systematic global 

satellite analysis from the University of Maryland (UMD), Version 1.4. 
Forest gain is not assigned to a particular year in this dataset, so is here 
distributed equally across the time period to give net figures. A forest 
definition of 10% canopy cover in 2000 was used. The green points and 
trendline (linear ordinary least squares) show net annual change in forest 
area across tropical countries from the FAO Global Forest Resources 
Assessment 201548, as summarized in ref. 7!. Forest area is reported at five- 
year intervals, the change has been calculated between each interval and 
then divided by five to give annual data. Various forest definitions are used 
by countries when producing these figures, with canopy cover ranging 
from 10-30%. This means that the total area of forest considered for 2000 
is higher in the UMD dataset, and would be expected to lead consistently 
to slightly higher deforestation for the UMD dataset compared to the FAO 
dataset. However, this difference in forest definition cannot explain the 
differences in trend, as only around 5% of losses in the UMD dataset are in 
forest with canopy cover of between 10 and 30%. 


Although there is general agreement that total forest area is shrinking 
across the tropics, there is considerable controversy as to whether the 
rate of loss is increasing or decreasing. Official figures from the Food 
and Agriculture Organization (FAO; collated from national statistics) 
show a decline in annual net forest loss rates since 20007!“8, whereas 
satellite-based data show an increase in the loss rate”” (Fig. 2). Some of 
this difference can be explained by differing definitions of forest and 
the precise area compared, but the difference in trend is too large to be 
explained by these alone. It has long been known that FAO statistics are 
not ideal for analysing trends*’: although some tropical countries prob- 
ably produce very good data, their monitoring capacity is variable™. 
As an example, 14 African countries have reported exactly the same 
annual change in forest area every year from 1990-2015*8, even though 
other datasets see considerable changes in their rates of loss over time**. 

On balance, the evidence from the remote-sensing data sources 
appears to be more reliable than the FAO data. For example, the 
satellite-derived product from Hansen et al.”” is produced consistently 
across the tropics, detects correct country-level trends where we have 
good alternative sources of data (for example, correctly seeing the rapid 
reduction in deforestation in Brazil® and the recent rapid acceleration 
of loss in the Democratic Republic of the Congo™), and matches well 
to detailed high-resolution data in a study that compares areas with 
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different patterns of forest loss. It is therefore likely that the rate of 
deforestation in the tropics is increasing. 

Over the coming decades, as the global demand for agricultural, timber 
and mineral commodities, as well as local population density, con- 
tinue to grow, it seems likely that the rate of forest loss will continue to 
increase***?*, Current areas that are largely undisturbed owing to inac- 
cessibility, such as the peat forests of the western Amazon and central 
Congo basins, are likely to become accessible and suffer deforestation®. 
Eventually the rate of forest loss will stabilize and start to fall, partly 
because the area of remaining unprotected forest will have greatly 
decreased, but also because once countries reach a sufficient level of eco- 
nomic development and forest loss, policy and civil society drivers result 
in the remaining forest area stabilizing or even increasing. However, 
this point may come only once most of the forest has been lost, and even 
when countries reach this point (such as Vietnam, China or much of the 
developed world), as their economies demand increasing levels of com- 
modities, they themselves will export deforestation to less economically 
developed countries”. The case of Brazil makes an interesting case study 
here: it greatly reduced its rate of forest loss from 2005-2014”, owing to 
reductions in global commodity prices and policy interventions”, but 
the rate has since increased again and could climb faster as the global 
demand for agricultural and mineral commodities increases, and laws 
promote development and not forest protection®®’. 

Forest degradation is difficult to map and monitor: as discussed pre- 
viously, there is little hard evidence of its overall current magnitude, 
let alone trends, although we suspect it involves a much larger area 
than deforestation each year*“*#°, Normally degradation appears to be 
closely associated with deforestation“*, and it is reasonable to assume 
in the future that, as the area of accessible forest increases owing to 
fragmentation caused by deforestation and road building, the area of 
forest degraded each year will also increase. About 20% of all tropical 
forest is now within 100 m ofan edge, with 84% of these edges being 
anthropogenic, and this proportion will continue to increase as more 
anthropogenic edges are created each year than are closed up*’. The 
effect of commercial logging, a major direct cause of degradation but 
also a driver of increased fragmentation and access roads, also seems 
likely to increase as ever more logging concessions are granted*)*”, 
Increases in fire due to climate change may result in increased degra- 
dation as well as increased fragmentation over time*”. Overall it is hard 
to believe that the area of forest degraded each year is not increasing, 
nor that it will stop increasing in the near future. 

As the area of degraded forest increases, so does the area of forest 
with the potential for regrowth; as such, the proportion of the forest 
sink that comes from previously disturbed forest is likely to increase 
with time’. Disturbed forest normally takes up carbon much faster per 
hectare than does undisturbed forest, though with high variability’. 
Under climate change, the rate of growth that could be achieved by 
disturbed forest could increase further, owing to CO, fertilization!®.”°. 
However, other factors (such as increased temperature and changing 
precipitation) could negate this effect: a specific modelling test as to 
whether land-use change increased the land sink under an extreme CO 
scenario found that only one out of four models predicted an increased 
sink, with the others showing no increased sink”. Fundamentally, the 
size of the sink from regrowing forest is very hard to model using cur- 
rent knowledge, with a high level of divergence between models”’. 
There is separate evidence that normally, once deforested, land is ulti- 
mately permanently converted to agriculture, pasture or settlements, 
and most degraded land is itself ultimately deforested”. This pattern 
is unlikely to change as the global economy and population continue 
to grow, so ultimately carbon captured in the regrowing sink may not 
remain captured for long. 

It is also difficult to predict the effect that climate change will have on 
the intact forest sink, because we know that climate change will bring 
opposing pressures® (Fig. 3). Theory and modelling studies generally 
agree that the most likely cause of the sink is CO; fertilization: as atmos- 
pheric CO, concentrations have increased from around 280 p.p.m. in 
1850 to 410 p.p.m. in mid-2018, fixing carbon through photosynthesis 
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Fig. 3 | The effects of climate and land-use change on the intact forest 
carbon sink. The potential contrasting effects of climate change and 
different land-use change trajectories on the size of the intact forest 
carbon sink. Arrows pointing up show how climate change and policy 


has become easier, owing to increased CO, concentrations in leaves for 
a given level of stomatal opening (itself limited by water availability)’. 
This effect should continue as CO, levels increase*’*; however, climate 
change will also raise temperatures, which will increase soil and plant 
respiration rates, and droughts and fires will also likely increase in 
both frequency and severity, directly killing trees (Fig. 3). In addition, 
deforestation and degradation will continue to reduce the area of intact 
forest that can act as a sink. Many studies therefore suggest that climate 
change could lead to a reduction in the strength of the carbon sink, and 
ultimately to its reversal into a carbon source!!”*-”°. There is evidence 
from networks of field plots that this is already happening, with the 
magnitude of the sink decreasing over time’>!*. 

However, models do not generally predict a reduction in the mag- 
nitude of the land sink, with many predicting that CO, fertilization 
will offset the negative influence of climate change on ecosystem res- 
piration and tree mortality?!®”°”°, For example, six coupled climate 
models that were run under the same CO, growth scenario predicted 
changes in tropical land carbon storage between 1960 and 2099 that 
ranged from —11 Pg C to +319 Pg C, with a mean of +172 Pg C 
(ref. °). The differences between these models are mostly attributed to 
differences in the sensitivity of tropical vegetation to temperature, and 
the extent to which the temperature increases owing to non-CO, forc- 
ings (for example, a reduction in aerosol concentrations, or increases 
in other greenhouse gases), which do not have an associated positive 
CO,-fertilization effect. Owing to the variability in model predictions 
of the current size of the intact forest carbon sink””!””78, and the lack 
of critical factors in the models, such as the mortality of large trees 
caused by droughts’”*, it is difficult to use model-based predictions 
for predicting trends in the forest sink. Therefore, the best evidence is 
from field plots'*" and satellites”"!, which show that the intact forest 
sink is weakening and that it becomes a source in unusually hot years. 
This suggests that it will probably reverse to become a carbon source 
under climate change. 

There is so little data on the carbon balance of intact peat forests that 
it is hard to speculate on how they are changing with confidence. But 
it is likely that increasing temperature and variable precipitation has 
increased the rate of carbon loss, especially when combined with draining 
and other disturbance, and that such losses are likely to accelerate in the 
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could increase the magnitude of the sink, whereas arrows pointing down 
show how it will be reduced. All processes will occur to some extent, so 
predicting how the sink size will change is very difficult. 


future*®>?>35, However, there are high uncertainties, and an increase 
in basic observations of peat forests are urgently needed. 


Modelling the future of tropical carbon 

From recent trends, it appears likely that the current major sources 
of emissions (deforestation and degradation of forests, including peat 
forests) will at least stay stable or probably increase over the coming 
decades**3**, whereas the current sinks (from intact and regrowing 
forest) will probably reduce, and could reverse and become 
sources'!’’*-”°. Therefore, it is very likely that tropical forests will 
become a net source of CO to the atmosphere in the near future, if 
they have not already*. However, estimating the size of this tropical 
source over time is very difficult, even when considering a specific 
scenario of land use and climate change, owing to complex feedbacks 
and interactions between different elements of the carbon cycle, climate 
change, people, policy and the global economy. 

The climate modelling community has produced ever more complex 
models that include the complex feedbacks between land-use change, 
climate change and intact forest’!. A noticeable difference between the 
4th and 5th Assessment Report of the International Panel on Climate 
Change (IPCC) is that the latter uses Earth system models (ESMs) for 
much of its predictions, rather than atmosphere-ocean general circula- 
tion models (AOGCMs)'®. ESMs include all the processes of AOGCMs, 
but add representations of biogeochemical cycles—including the full 
carbon cycle—and couple these cycles with other components that 
allow for feedbacks. For example, deforestation in an AOGCM sim- 
ulation will increase the atmospheric CO2 concentration and change 
the physical properties of the ground surface, but only in an ESM will 
the smoke and dust released from deforestation, and their subsequent 
effect on atmospheric chemistry and the rate of photosynthesis of the 
remaining trees, be modelled”’. 

To standardize the inputs for modelling climate change under 
different scenarios to 2100, the IPCC developed four representative 
concentration pathways (RCPs) for its 5th Assessment Report)’. 
These are trajectories of atmospheric greenhouse gas concentration 
and consequent radiative forcing (amount of solar energy absorbed 
by the Earth), and are named after the prescribed radiative forcing 
in the year 2100 relative to preindustrial levels in W m7: RCP2.6, 
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RCP4.5, RCP6 and RCP8.5. They are based on underlying assump- 
tions about social and technological development, and the extent to 
which climate mitigation activities take place: RCP8.5 assumes that 
annual fossil-fuel emissions increase rapidly until around 2070 before 
eventually stabilizing, whereas RCP2.6 assumes that emissions peak 
by 2020 and then decrease rapidly”. Ideally it would be possible to 
provide confident predictions of the size of the forest sinks under these 
different scenarios, but unfortunately there is still high variability in 
the predictions of the tropical carbon cycle using ESMs, and these 
predictions cannot agree as to whether the tropical land surface will 
gain or lose carbon overall under the different scenarios”’’. Much of 
the uncertainty (around 80%) in tropical land surface prediction is 
caused not by uncertainty in the scenarios but by differences in model 
structure”’; in particular, differences in the predicted effects of specific 
climate parameters and CO, concentration on net primary productivity 
and vegetation turnover (including structural shifts, wildfires and 
mortality)®”. It is therefore urgent that we improve our knowledge of 
how the components of the tropical carbon cycle function, to better 
design and test such models. 


Effect of policy on tropical forests 

The extreme RCPs (RCP2.6 and RCP8.5) assume similar levels of con- 
version of tropical forest to agriculture”’, with the differences aris- 
ing largely from the degree of fossil fuel burning. However, in reality, 
considerable developments in national and international policy 
since the last IPCC report in 2013 have made the reduction of trop- 
ical deforestation and degradation, and the restoration of previously 
degraded and deforested tropical land, a key pillar of reducing cli- 
mate change. This is sensible, because unless tropical deforestation 
and degradation is reversed, the task of halting the rise in atmospheric 
CO, concentrations would involve decarbonizing the global economy 
at what would probably be an unfeasible rate?*?3, and it offers the 
possibility of a different path for the tropical carbon cycle rather than 
continuing current trends. 

The Paris Agreement of 2015, which entered into force in 2016 and is 
now ratified by 178 of the 197 countries of the United Nations Framework 
Convention on Climate Change, aims to limit increases in global 
average temperature to “well below 2 °C above pre-industrial levels”. 
It does not specify how this should be reached, but includes a strong 
statement in Article 5 that countries “should take action to conserve 
and enhance ... forests”*!. To assist developing countries with meeting 
Article 5, it “encourages” all countries to engage in REDD+ (defined as 
‘reducing emissions from deforestation and forest degradation, and the 
role of conservation, sustainable management of forests and enhance- 
ment of forest carbon stocks in developing countries’)*!. The details of 
financing and monitoring have yet to be agreed, but there is consider- 
able optimism that REDD-+ can succeed in increasing the area of for- 
est, and the proportion of it that is undisturbed, compared to business 
as usual*?®*83_ As a result of ‘REDD Readiness’ funding, most tropical 
countries have increased their capacity to monitor changes in their own 
forests”, and are submitting Forest Reference (Emission) Levels—official 
baselines against which future emissions can be compared—and plans 
for reducing emissions below these levels if funding is provided. 

Although the Paris Agreement is ambitious in overall terms, its 
proposals on forests lack concrete detail, stating only that countries 
should “take action”. However, there are other international agree- 
ments that involve many or most of the same countries and are more 
specific. For example, the New York Declaration on Forests, signed 
by 192 organizations including 40 governments in 2014**, aims to 
“at least halve the rate of loss of natural forests globally by 2020 and 
strive to end natural forest loss by 2030”. This was ambitious, but some 
believed it was achievable**. More ambitious still, the UN Sustainable 
Development Goals (SDGs)**, agreed in 2015, include as Target 15.2 
an aim to “by 2020 ... halt deforestation”. This was included not only 
because preventing climate change is one of the key aims, but also 
because healthy tropical forests are important for the achievement 
of most of the 17 goals®*. Few believe that deforestation can really be 
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stopped so fast, but these international agreements will spur at least 
some countries to enact policies to greatly reduce their deforestation 
rates?>°, 

The New York Declaration on Forests further aims to restore 150 
million hectares of currently deforested or degraded land by 2020, and 
350 million hectares by 2030. There are worries related to these ambi- 
tious targets: there is a risk that natural grasslands will be afforested, 
leading to a loss of biodiversity and potentially also soil carbon®, or 
that agriculture will be displaced by restored forest, leading to more 
deforestation elsewhere®’. Also, many countries have not committed to 
meet their goal solely through natural forest (for example, through leav- 
ing degraded land to regenerate naturally, with the greatest ecological 
and long-term carbon benefits), but instead will plant monocultures 
of exotic tree species such as teak and rubber. Nonetheless, this overall 
enthusiasm for the restoration of forests should be positive for tropical 
carbon storage (Fig. 3), and if the sites are chosen well and the restora- 
tion type carefully considered, it could be highly beneficial for people 
and the environment””*’. 

Looking further into the future, the Paris Agreement mandates that, 
by the second half of the 21st century, the remaining anthropogenic 
greenhouse gas emissions will be balanced by sinks”’. This will require 
a large programme of capturing carbon directly from the atmosphere 
and storing it elsewhere*”. As tropical trees are by far the most efficient 
carbon-capture method known, a technology known as bioenergy with 
carbon capture and storage (BECCS) is proposed, which will generate 
energy through the burning of tropical plantations and store the pro- 
duced CO, below ground”>. To meet the negative emissions targets 
needed to limit global warming to below 2 °C by 2100, models suggest 
it would need to be implemented on an enormous scale (400-800 Mha; 
for comparison, India covers 329 Mha)*?*°. This would clearly make 
the tropics a major carbon sink, but with negative consequences for 
biodiversity and ecosystem services. 

Overall, these agreements are sufficient to change current trends 
considerably, and if fully implemented would increase the forest carbon 
storage of the tropics markedly over the coming century. However, 
meeting the targets would involve drastic and coordinated action from 
people, policy makers and companies globally***?. 


Safeguarding tropical forest carbon 

The evidence suggests that unless the world makes a coordinated 
effort to change from its current course, deforestation, degradation 
and climate change will combine to make the tropics a net source of 
carbon to the atmosphere over the coming decades. This is despite 
increasing CO; levels making it easier for intact forests to photo- 
synthesize and absorb carbon®'®. However, if we were able to stop 
deforestation and forest degradation, leave currently degraded forests 
to recover, and reforest, as targeted in international agreements, then 
tropical forests would instead probably become an important carbon 
sink, contributing to the Paris Agreement goal of limiting mean global 
temperature increases to below 2 °C”*”?, Retaining and restoring these 
forests would have further immense benefits to human wellbeing, 
through maintaining their biodiversity and ecosystem services™. 
However, two interconnected problems limit the achievement of these 
goals. Our spatial information on how forests are changing is poor, 
and a lack of field experiments means that ESMs cannot predict well 
how forests will respond to different scenarios of climate and land- 
use change. 

Although we monitor deforestation well, we do not have good data 
on changes within forests. We have techniques that can observe the 
integrated carbon flux over large regions, but we have very little know1- 
edge of the size of the individual processes involved (such as degra- 
dation, regrowth, or the impact of droughts and fire). This makes it 
difficult to design and implement policy: for example, no country has 
reliable baseline figures on their rate of forest degradation™, so it is 
difficult to set targets or create policies to reduce degradation under 
REDD-4, and to receive payments even if such policies are successful. 
This also limits model development and testing. 
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New satellite missions, such as GEDI and OCO-3 (both planned for 
launch in 2018) and BIOMASS (2021), will help by producing high reso- 
lution, globally consistent maps of forest carbon stock changes for the 
first time. These will not only assist with targeting and monitoring poli- 
cies, but will also enable us to discover the magnitude of the forest sink at 
an unprecedented resolution (less than 1 km7), and how it is influenced 
by local conditions and climate fluctuations. However, these satellites 
require pantropical forest inventory and airborne LiDAR data for 
calibration and validation. REDD+ will assist directly here: already, 
considerable funding has been spent on designing and setting up 
monitoring systems and capacity in developing countries®. 
Unfortunately the data collected is rarely made available to the inter- 
national scientific community, as publishing such data is against the 
natural instincts of countries, who wish to protect their sovereignty (there 
are some exceptions: for example, field and LiDAR data from a recent 
carbon stock map of the Democratic Republic of the Congo is available 
at http://panda.maps.arcgis.com). Funders and scientists must persuade 
countries to be more open, or they will not obtain the full benefits from 
new Satellite missions. 

Better maps of forest carbon stocks will make a big difference, remov- 
ing the current wide spread of figures on the carbon fluxes from trop- 
ical forests (Figs. 1, 2), supporting REDD-+ and other policy efforts to 
reduce forest loss, and enabling the testing of ESMs and theories as to 
how tropical forests respond to climate fluctuations and disturbance 
events. However, these data will not improve our understanding of how 
forests will respond to climate and CO, conditions that do not currently 
exist, an understanding that is necessary for improving ESMs. For this 
we need field experiments, such as those that artificially drought, warm 
or increase the CO, concentration of large tropical forest plots. Such 
experiments are expensive to run and take many years to produce use- 
ful results”°, and therefore at present they are almost non-existent in the 
tropics®*. Their development should be supported by governments, as 
without them there will be no data to develop and test the critical next 
generation of ESMs’*. 
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El Nifio events are characterized by surface warming of the tropical Pacific Ocean and weakening of equatorial trade 
winds that occur every few years. Such conditions are accompanied by changes in atmospheric and oceanic circulation, 
affecting global climate, marine and terrestrial ecosystems, fisheries and human activities. The alternation of warm 
El Nifio and cold La Nifia conditions, referred to as the El Nifio-Southern Oscillation (ENSO), represents the strongest 
year-to-year fluctuation of the global climate system. Here we provide a synopsis of our current understanding of the 
spatio-temporal complexity of this important climate mode and its influence on the Earth system. 


he view on the El Nifio phenomenon—originally described in 

1893 as ‘corriente del Nifio’”!, a warm regional ocean current that 

affected climate off the coast of Peru—has changed over the past 
century. In the 1960s, ENSO was recognized as a basin-scale phenomenon 
involving coupled atmosphere-ocean processes”. A major international 
research programme in the 1980s and 1990s fundamentally advanced 
the ability to observe, understand and predict ENSO and its world-wide 
impacts*. During the past 20 years, our understanding of ENSO has 
continued to evolve, as new layers of complexity (Box 1) have been iden- 
tified in ENSO dynamics and predictability. The concept of El Nifio has 
developed from one of a canonical progression of phases from onset to 
maturity and demise? (Fig. 1) to one that accounts for its spatio-temporal 
complexity (Fig. 2) and varying climatic impacts** (Fig. 3). We have also 
come to realize that although ENSO primarily manifests itself as a year-to- 
year climate fluctuation, its dynamics involves a broad range of processes 
interacting on timescales that range from weeks”! to decades!". Here the 
diversity in patterns, amplitude and temporal evolution of this climate 
phenomenon will be referred to as ENSO complexity (Box 1). 

The most recent El Nifio event!2, in 2015/2016, was initiated in boreal 
spring by a series of westerly wind events (WWEs) (Box 1, Fig. 3e)—a 
form of tropical weather noise. The associated wind forcing triggered 
downwelling oceanic Kelvin waves (Box 1, Fig. 1c), thus reducing the 
upwelling of cold subsurface waters in the eastern Pacific cold tongue 


(Box 1) and leading to surface warming in the central and eastern Pacific. 
The positive sea surface temperature anomaly (SSTA) shifted atmospheric 
convection from the Western Pacific Warm Pool (Box 1) to the central 
equatorial Pacific, causing a reduction in equatorial trade winds, which 
in turn intensified surface warming through the positive Bjerknes feed- 
back (Box 1). The seasonally paced termination of the 2015/2016 event 
(Fig. 3e) was associated with ocean dynamics and the slow discharge of 
equatorial heat into off-equatorial regions, thus providing a delayed neg- 
ative feedback (Box 1). The event started to decline in early 2016 and 
transitioned into a weak La Nifia in mid-2016. 

In broad terms, this evolution is common to the other strong El Nifio 
events (Fig. 1), in 1982/1983 and 1997/1998 (Fig. 3c). However, no two 
events are alike—be they strong, moderate or weak (Figs. 2, 3f-m). 
This diversity arises from the varying roles of noise forcing (Fig. 3c-e) 
and of positive and negative coupled atmosphere-ocean feedback pro- 
cesses! (Box 1) that act to enhance and suppress the growth of SSTAs, 
respectively. The complexity of ENSO (Box 1), along with internal atmos- 
pheric noise, also translates into a diversity of global impacts”'*, When 
the underlying sea surface temperatures (SSTs) change in the equatorial 
Pacific, there are shifts in atmospheric deep convection, which in turn 
cause adjustments of the global Walker circulation (Box 1) and generate 
stationary atmospheric waves!> that impact the far reaches of our planet. 
This perturbed global circulation influences weather variability, leading 
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Box| 
ENSO glossary 


Bjerknes feedback. Positive ENSO feedback along the Equator, 
in which a weakened (strengthened) equatorial zonal sea 
surface temperature (SST) gradient weakens (strengthens) trade 
winds, which in turn further reduce (increase) the zonal SST 
gradient. 

Combination tones/C-mode. Enhanced spectral energy on 
timescales of 9 months and 15-18 months, generated by the 
nonlinear modulation of ENSO by the seasonal cycle, and vice versa. 
C-modes play an important part in the seasonal turnabout of El 
Nifio events. 

Eastern Pacific cold tongue. An eastern equatorial Pacific region 
characterized by wind-driven upwelling of cold subsurface waters 
(Fig. la, b). The cold tongue warms considerably during eastern 
Pacific El Nifio events and cools during La Nifla events. 

Ekman feedback. Positive (negative) SST anomalies (SSTAs) weaken 
(strengthen) the equatorial trade winds, reducing (increasing) the 
upwelling of cold subsurface water in the eastern equatorial Pacific, 
thus reinforcing the original SSTA. 

ENSO complexity. ENSO complexity expands on the concept of 
ENSO ‘pattern diversity’ to include also temporal characteristics 
(from weather, annual cycle, interannual to decadal timescales), 
dynamics, predictability and global impacts. 

ENSO skewness. Amplitude asymmetry of El Nifio and La Nifia 
events, which quantifies the fact that El Nifo events attain larger 
amplitudes than La Nifia events. Skewness is a clear indication of 
nonlinearity in the ENSO cycle. 

Equatorial Kelvin wave: Eastward-propagating oceanic 

internal wave that displaces the interface (thermocline) 

between warm surface waters and cold subsurface waters. Westerly 
(easterly) equatorial wind anomalies generate downwelling 
(upwelling) Kelvin waves, which deepen (shoal) the thermocline 

in the eastern Pacific and reduce (enhance) the efficiency of 
climatological upwelling. 

Multiplicative noise. Interaction between westerly wind events and 
underlying SST in the western and central Pacific, in which warmer 
(colder) SST favours more (fewer) westerly wind events; also 
referred to as state-dependent noise. 

Recharge/discharge. Meridional transport of heat into/out of 

an equatorial band, driven by changes in near-equatorial wind 
variations. Recharge/discharge processes have a key role in the 
initiation and termination of El Nifio events. 

Thermal damping. Typically a negative feedback arising from 
SST-induced changes in surface radiative and turbulent heat fluxes 
in the tropical Pacific. It involves tropical clouds, convection and 
atmospheric boundary layer physics. 

Thermocline feedback. Generally positive feedback operating in the 
eastern equatorial Pacific, in which a warm (cold) equatorial SSTA 
weakens (strengthens) equatorial trade winds, leading to mean 
upwelling of anomalously warm (cold) water. 

Westerly wind event: Weather systems in the western and central 
Pacific that are often associated with an abrupt relaxation of the 
equatorial trade winds, generating downwelling Kelvin waves and an 
eastward expansion of the Western Pacific Warm Pool. 

Western Pacific Warm Pool. Some of the warmest waters in 

the worlds’ oceans occur in the western tropical Pacific, with 
temperatures exceeding 28 °C (Fig. 1a, b). The Western Pacific 
Warm Pool’s seasonal north-south migrations have an important 
role in the termination of El Nifio events. 

Zonal advective feedback: Positive feedback, particularly effective 
in the central Pacific, in which a positive (negative) equatorial SSTA 
weakens (strengthens) equatorial trade winds, reducing (enhancing) 
the oceanic transport of cold waters from the eastern Pacific. 
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to massive reorganizations of tropical and extratropical temperature and 
rainfall patterns!®!” (Fig. 3f-m). 

Palaeo-climate reconstructions of the ENSO phenomenon covering 
the past ~10,000 years also show a wide range of amplitudes!®, thus 
highlighting the importance of internal climate processes in modulat- 
ing ENSO complexity on timescales ranging from decades to centuries. 
In addition, the activity of reconstructed ENSO variability shows an 
intensification in the late 20th century relative to other pre-industrial 
periods'*””, thus raising the general question of whether external forc- 
ings could influence the evolution and amplitude of ENSO. How ENSO 
responds to greenhouse warming is one of the most compelling outstand- 
ing questions”, 

Given the societal and environmental relevance of ENSO, it is para- 
mount to improve our understanding of the processes that control the 
amplitude, timing, duration, predictability and global impacts of ENSO. 
Here we assess our current understanding of ENSO dynamical processes 
and their role in controlling complexity of this fundamental climate fea- 
ture. Against this backdrop, we highlight areas of uncertainty (see ‘A uni- 
fying framework’) as a stimulus for further research. 


A conceptual view of ENSO dynamics 

Early efforts to elucidate the dynamics of ENSO focused on the average 
(composite) evolution of El Nifio events”!, capturing the typical 
evolution of ocean and atmosphere conditions from the early-spring 
initiation of El Nifo, to its winter-time peak and transition to La Nifia 
during the subsequent summer (Fig. 1). The enhanced spectral inter- 
annual variability of ENSO (Fig. 3a, b) has been explained by invoking 
positive atmosphere and ocean feedbacks and delayed negative ocean 
adjustment feedbacks (Box 1), which together can lead to oscillatory 
dynamics, as encapsulated by a variety of conceptual ENSO models”. 
Here we focus on the ENSO recharge oscillator model?? which, in its 
most general form, can be expressed as: 


AT, /dt = Ip. + Fh 


dh/dt = —eh—a¥, w) 
where T, and h represent the equatorial eastern Pacific surface temper- 
ature and zonal mean thermocline depth, respectively, and dT./dt and 
dh/dt are the corresponding time derivatives. The Bjerknes stability 
index, Ipy, (also referred to as the BJ index or ENSO linear growth 
rate; Fig. 1j) depends on a number of processes (such as thermocline, 
zonal advective and Ekman feedbacks) that reinforce SST, and on neg- 
ative feedbacks from thermal advection by horizontal mean surface 
currents and thermal damping by net surface heat fluxes!? (Box 1). 
In equation (1), ¢ represents the damping rate of thermocline depth 
anomalies. The interannual timescale of the ENSO system is mainly 
determined by F and a, which describe the thermocline feedback 
(Box 1) and the slow equatorial recharge-discharge process (Box 1) 
associated with oceanic heat transport, respectively. For a constant Ipy, 
the model describes a linear recharge oscillator: starting from neutral 
conditions with T. ~ 0 (typically in boreal winter—spring; Figs. 1c, 2) 
and a charged thermocline state, h > 0, an El Nifio event can grow 
(Figs. 1d, e, 2). While eastern equatorial Pacific SSTAs develop, the 
thermocline feedback, Fh, (Box 1) further intensifies the growth of 
the SSTAs by upwelling anomalously warm subsurface waters to the 
surface in the eastern Pacific cold tongue. Moreover, positive eastern 
Pacific SSTAs (T. > 0) cause a weakening of the equatorial trade winds 
(Fig. 1d, e). The associated wind-stress curl discharges the equatorial 
heat through Sverdrup transport (Box 1) and ocean boundary pro- 
cesses (Fig. 1f). The resulting drainage of heat in turn weakens the 
thermocline feedback, and the phase of the ENSO recharge oscillator 
can transition into a La Nifa state (Fig. 1g, h), which is accompanied 
by recharging of heat through opposite wind-stress curl anomalies 
(Fig. 1h). 

Comparing the linear oscillator solution of equation (1) (constant 
Ty) with the scatter plot of observed equatorial eastern Pacific 
temperature and zonal mean thermocline depth anomalies (Fig. 2), 
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boxes list major processes involved in the phases of El Nifio evolution. 


we find substantial differences. The observed scatter diagram shows a 
high degree of irregularity and a notable positive skewness in eastern 
tropical Pacific SSTAs towards El Nifio events (Box 1, Fig. 2). El Nifio 
and La Nifia events are very different in terms of their amplitude and 
time evolution (Figs. 1k, 2a). To account for this additional level of 
complexity, the simple recharge oscillator model can be extended by 
including a nonlinear Bjerknes feedback term that represents either 
atmospheric or oceanic nonlinear processes” or multiplicative sto- 
chastic forcing’® (Box 1). For these extensions, the recharge model can 
then simulate the skewed probability distribution of ENSO (Fig. 2) and 
the fast growth from neutral to strong El Nifio conditions (Fig. 1j). The 
observed positive skewness of the SSTA (Fig. 2), which indicates the 
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i, The seasonal composite means (lines) and spread (shading) of eastern 
equatorial Pacific SSTA (red; averaged over the Nifio3 region: 5° S-5° N 
and 90° W-150° W) and equatorial Pacific zonal mean Z20 (blue) for 
the 17 El Nifio events. The diamond illustrates that ENSO predictability 
increases with increasing ENSO signal strength. j, The monthly standard 
deviation (s.d.) of the Nifio3 SSTA”® (red line) and an estimate of the 
monthly ENSO growth rate based on the Bjerknes stability index. The 
error bars show the 90% confidence range for the index calculated from 
the standard error of the regression slope*. k, Time series of the Nifio3 
SSTA and zonal mean equatorial Pacific depth anomaly from the 20 °C 


isotherm (2° S-2°N and 120° E-80° W) from the merged data product*””*’, 


importance of nonlinear dynamical and thermodynamical processes 
in the coupled tropical Pacific climate system, implies that strong 
El Nifio conditions, which typically last for one year, are on average 
shorter than La Nifa events, which can persist for up to several years 
(Fig. 1k). 

Whereas conceptual models like equation (1) can simulate some 
key features of ENSO evolution, they can neither explain the 
presence of its spatial diversity (Fig. 3a, b, f-m) nor the potential 
remote effects of variability originating from the extra-tropical Pacific, 
Atlantic or Indian Ocean onto this diversity. An improved framework 
to characterize and explain ENSO complexity is needed to capture 
these aspects. 
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Fig. 2 | Schematic representation of ENSO temporal complexity. Kernel 
density estimate of the joint probability distribution (orange shading) of 
the linearly detrended Nifio3 SSTA and zonal mean 20 °C isotherm depth 
anomalies (2° S-2° N and 120° E-80° W) for the period 1958-2016 from 
the merged data product?”*’. The grey circles indicate the monthly values 
of the two time series, smoothed with a three-month running mean filter. 
Dark and light-blue triangles indicate December values of EP (1972, 


Space-time complexity of ENSO 

Despite some prominent commonalities discussed in the previous 
section (Fig. 1), El Nifio events differ considerably from each other in 
terms of magnitude, spatial pattern, temporal evolution and predict- 
ability>-” (Figs. 2, 3f-m). To characterize the leading modes of equa- 
torial Pacific SST variability and their diverse timescales, we conduct 
an empirical orthogonal function (EOF) analysis of observed tropical 
Pacific SSTAs”® (Fig. 3), which identifies the leading orthogonal pat- 
terns of variability. The leading EOF (Fig. 3a), which corresponds to the 
classical El Nifo pattern with eastern tropical Pacific warming, exhibits 
variability on quasi-quadrennial timescales (3-7 years; spectral density 
estimate in Fig. 3a). By contrast, the second EOF, which explains only 
25% of the variance of the first mode, is characterized by an east-west 
zonal SST dipole in the tropical Pacific and has enhanced variance 
on quasi-biennial and decadal timescales (spectral density estimate in 
Fig. 3b). The interplay of these two EOFs largely captures the spatial 
diversity of the observed ENSO mode. 

Some El Nijfio events (for example, that of 1997/1998) are charac- 
terized by pronounced warming in the eastern Pacific and are referred 
to as EP El Nifio events (Fig. 3c, f), whereas others (for example, that 
of 2004/2005) show a stronger positive projection on the second EOF 
mode, which leads to a more pronounced central Pacific warming, and 
are known as CP El Nifio events (Fig. 3d, h-j). More generally, El Nifo 
events can be viewed as the superposition of the two EOF modes, which 
results in a continuum of ENSO characteristics”””* that capture a mix 
of EP and CP dynamics (for example, the 1991/1992 and 2015/2016 
events) (Fig. 2). La Nifia events (for example, that of 1999/2000; Fig. 3g), 
in addition to being weaker than El Nifo events, exhibit less diversity 
in their spatial patterns®”®, thus clearly pointing to an asymmetry in 
the underlying dynamical processes for ENSO. 

EP El Nifio events (for example, that of 1997/1998; Fig. 3c) tend to 
involve basin-scale equatorial wind anomalies, a strong relaxation of the 


538 | NATURE | VOL 559 | 26 JULY 2018 


1976, 1982, 1986, 1997, 2006 and 2015) and CP (1968, 1994 and 2009) El 
Nifios, respectively. Mixed events (1965, 1991 and 2002) are represented 
by combined dark and light-blue triangles. The years of the five largest 
El Nifio events are indicated. The white ellipse in the centre corresponds 
to the progression of the linear recharge oscillator, and arrows on the left 
(right) indicate charging (discharging) of subsurface warm water in the 
equatorial Pacific. 


zonal tilt of the equatorial thermocline (Fig. le), a more prominent role 
for the thermocline feedback (Box 1), large eastward shifts of tropical 
Pacific convection and strong discharge of heat content (Fig. 3c) away 
from the equatorial region, which boosts the likelihood of transitioning 
into a La Nifia event®?”, By contrast, CP El Nifio events (for example, 
that of 2004/2005; Fig. 3d) tend to involve more local wind feedbacks, 
a stronger role for the zonal advective feedback (Box 1), little reduction 
in the zonal tilt of the thermocline, weak shifts of convection, earlier 
termination, little poleward discharge of ocean heat content (Fig. 3d), 
a stronger role for thermal damping (Box 1) during the decay phase, a 
reduced likelihood to transition into La Nifia and more susceptibility 
to disruption by wind noise®*", Compared to CP El Nifos, strong EP 
El Nifios also tend to terminate later in boreal spring owing to trade 
wind collapse, which suppresses the upwelling that normally connects 
the SST to the evolving thermocline depth". 

The spatial diversity in the SSTA patterns of ENSO is also associated 
with different tropical precipitation patterns (Fig. 3f-m), resulting in 
potentially different remote teleconnection patterns and corresponding 
weather and climate impacts”*”. However, given the high level of inter- 
nal atmospheric variability** and the brevity of the historical record, 
it has remained difficult to unequivocally detect the differences in the 
impacts of various ENSO spatial modes. In addition to its spatial diver- 
sity, ENSO also exhibits substantial diversity in its temporal evolution 
(Figs. 1k, 3c-e). Understanding this diversity of El Nifo events is cru- 
cial for predicting the regional impacts of ENSO, for example, on pre- 
cipitation patterns, tropical cyclones and other types of severe weather’. 
The extent to which El Nifio diversity is predictable relates to whether 
ENSO complexity originates mainly from random processes or from 
low-frequency deterministic dynamics. Random processes affecting a 
single physical ENSO mode could generate diversity in amplitude, spa- 
tial structure and temporal evolution’, consistent with a spatial flavour 
continuum generated by different realizations of atmospheric noise’. 
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variance-preserving spectral power density of the normalized principal 
components shown on the left (the vertical axis is the period in years and 
the horizontal axis is the logarithm of the power). Numbers in parentheses 
indicate the variance of the EOF modes. c-e, Longitude-time evolution 

of the Pacific SSTA averaged over 5° S-5° N for selected observed ENSO 
events, the 28.5 °C isotherm of the SST (red curve) representing the edge 
of the Western Pacific Warm Pool, the longitude and strength of WWEs” 
(black circles) and (on the left) the associated equatorial Pacific heat- 
content anomaly (temperature anomaly averaged over the top 300 m of 


Alternatively, initial subsurface ocean conditions could modulate the 
role of stochastic wind forcing in producing diversity. For example, 
climate model simulations have demonstrated that in the presence of 
stochastic WWEs, an initial build-up of equatorial Pacific upper-ocean 
heat content can favour the development of EP, rather than CP, El Nifio 
events!04 (Fig. 3d, e). At the onset of strong El Nifio events**, such as 
those of 1997/1998 and 2015/2016 (Fig. 3c, e), WWE activity tends to 
strengthen and expand eastwards with the expansion of the Western 
Pacific Warm Pool and the relaxation of the trade winds. These WWE 
changes can be parameterized in equation (1) as multiplicative noise 
(Box 1), which can contribute to ENSO diversity and asymmetries™?®. 

Studies suggest that ENSO diversity may be triggered by climate 
phenomena outside the tropical Pacific, including the North*” and 
South*® Pacific meridional modes, extra-tropical atmospheric 
circulation patterns and tropical Atlantic variability>**". For example, 
the negative phase of the North Pacific Oscillation’! tends to favour the 
development of positive SSTAs in the central Pacific by weakening the 
trade winds in the Northern Hemisphere, while the positive phase of 
the South Pacific Oscillation tends to weaken the Southern Hemisphere 
trade winds, thereby favouring the development of positive SSTAs in 
the eastern Pacific. Such remote influences appear to be mediated 
primarily by how they project onto wind variations in the equatorial 
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the ocean and between 5° S-5° N and 120° E-90° W; range from —1 K to 

1 K; red, positive; blue, negative). f-m, Spatial pattern of SSTA (shaded) 
and precipitation anomaly’™ (contours; solid line, positive; dashed line, 
negative; 2mm d-! interval; zero contour omitted) averaged over the 
November-January season of selected ENSO events. We note that strong 
warm events (1997/98 and 2015/16) induce very strong eastward and 
equatorward shifts of rainfall. At the bottom right of a, b and f-m we show 
the associated principal components (PCs); namely, the projection of each 
SSTA spatial pattern onto the EOF patterns in a and b. The abscissa is PC1, 
the ordinate is PC2 and the arrow length is proportional to the magnitude 
in PC1-PC2 space (an arrow magnitude of 1 is indicated by the circles). 
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Pacific. In essence, westerly wind anomalies in the western equatorial 
Pacific tend to favour CP El Nifios, whereas westerly wind anomalies in 
the central-eastern equatorial Pacific tend to favour EP El Nifios. These 
external influences can precede the peak of El Nifio by 2-3 seasons**4! 
and may provide additional predictability to the spatial characteristics 
of an emerging El Nifio event. 

Since 1998, CP events have become more prevalent than EP events’. 
Such a decadal modulation in ENSO diversity is consistent with cou- 
pled general circulation models (CGCMs) that can spontaneously gen- 
erate multidecadal variations in ENSO diversity even in the absence of 
external radiative forcings*’. Low-frequency climatic drivers (includ- 
ing natural and anthropogenic forcings)—which involve basin-wide 
changes in the zonal SST gradient, thermocline depth and winds***°— 
may also have contributed to the observed decadal swings in ENSO 
diversity by favouring particular spatio-temporal modes“. At this stage, 
the observational record remains too short to quantify all the possible 
sources of the decadal modulation of ENSO characteristics. 

The current generation of climate models underestimates ENSO 
diversity*”. This issue is related to the models’ systematic biases, which 
affect the mean state and ENSO feedbacks. Sources of these biases 
include deficiencies in the simulation of clouds, atmospheric convec- 
tion and oceanic mixing**. In particular, atmospheric model responses 
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trends. Every time the probability for SSTAs (heat-content anomalies) to 
be in the range from —3 °C to 3 °C (—2 °C to 2 °C) is ~100%. However, 
some SSTA and heat-content values are more likely to occur than others. 
This probability is indicated by the coloured contours (probability 


tend to be relatively insensitive to distinct patterns of SSTAs owing to 
climatological dry and cold biases in the equatorial central Pacific’. 


Seasonal ENSO dynamics 

ENSO displays a close relationship with the seasonal cycle?!*°: El Nifio 
events usually start in boreal spring (Figs. Ic, i, 4), grow during the 
summer and autumn (Fig. 1d), reach their maximum intensity in win- 
ter (Fig. le, i) and decay rapidly during late winter and spring (Fig. 1f, j). 
In most cases, they transition to La Nifa events (Figs. 1g, h, 4) by the 
subsequent summer. This seasonal synchronization of ENSO translates 
into the observed eastern equatorial Pacific SSTA variance peaking dur- 
ing boreal winter and attaining minimum values during spring (Fig. 1)). 
It also leads to pronounced seasonal contrasts in the climate impacts 
and predictability of ENSO (Fig. 1i). ENSO influences the global atmos- 
pheric circulation, affecting, for instance, the Asian monsoons”! and 
the climate in America®’ and Australia®. 

Randomly occurring sequences of WWEs, typically during spring, 
can lead to an initial warming of the central eastern equatorial 
Pacific**°> (Fig. 1c). This initial SSTA can grow because the air-sea cou- 
pling is strongest in summer and early autumn*°*’ (Fig. 1j). Proposed 
physical processes for this summer-autumn coupling maximum 
include (i) the shift of the Intertropical Convergence Zone towards 
the Equator and its associated increase in western Pacific surface wind 
convergence®®, (ii) the seasonal outcropping of the equatorial thermo- 
cline, (iii) the seasonal cooling of the eastern equatorial Pacific*’ and 
(iv) the reduction of negative cloud feedbacks™. 

The decay of El Nifo events typically starts in boreal winter. The 
anomalous westerly wind anomalies shift southwards from the Equator, 
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leading to a shoaling of the eastern Pacific thermocline and a sub- 
sequent reduction of the overlying SSTA®! (Figs. 1f, 4a). This shift 
arises from climatological expansion of the Western Pacific Warm 
Pool into the Southern Hemisphere, coincident with the development 
of the South Pacific Convergence Zone™. In this season, the increased 
surface heat flux damping” results in a decrease of the air-sea coupling 
strength (Fig. 1j), which—together with the aforementioned seasonal 
southward wind shift® and the equatorial heat content discharge” 
(Fig. 4b)—leads to a rapid transition to a La Nifa state. 

While these seasonal processes generally operate for different fla- 
vours and phases of ENSO, differences in their relative importance can 
contribute to ENSO complexity. For instance, CP events typically termi- 
nate earlier and are less likely to transition to a La Nifia state compared 
to EP El Nifio events™ (Fig. 3d). Furthermore, La Nifia conditions can 
last up to 2-3 years (Figs. 3c, 4b). The ability to simulate ENSO seasonal 
synchronization for different types of El Nifio events varies strongly 
among the current generation of climate models, probably owing to 
biases in mean state and seasonal cycle*®. 

The influence of the seasonal variations of the air-sea coupling 
strength discussed above can be included in the framework of the 
recharge oscillator (equation (1)) by adding a seasonally varying growth 
rate (Ipy). As expected, this model then captures the observed ENSO 
seasonal synchronization characteristics, including the seasonal ENSO 
variance modulation and partial phase synchronization”. Interactions 
between the seasonal cycle in Jpy and the interannual ENSO temperature 
signal generate variance with periods of roughly 9 and 15-18 months, 
the so-called combination tone frequencies (Box 1) that broaden 
the ENSO spectrum predominantly towards higher frequencies. 
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These interacting dynamics create specific atmospheric circulation pat- 
terns that are together referred to as combination mode (C-mode) 
(Box 1). The spatial pattern of the C-mode exhibits a pronounced 
hemispheric asymmetry, which includes an anomalous cyclonic low- 
level wind circulation in the southern central tropical Pacific and an 
anomalous anticyclonic low-level wind circulation in the northwestern 
tropical Pacific. Some of the prominent local expressions are the afore- 
mentioned southward shift of equatorial wind anomalies® (Fig. le) and 
the anomalous western North Pacific anticyclone®™. 


ENSO predictability 

To link our understanding of dynamical tropical air-sea interactions 
with ENSO predictability, it is helpful to elucidate the seasonal evolution 
of (i) potential precursors that may contribute to long-term predicta- 
bility®* (9-15 months lead time), (ii) triggers that can rapidly increase 
the likelihood for event development (6-9 months lead time) and 
(iii) transition processes (see ‘A conceptual view of ENSO dynamics’ 
and ‘Seasonal ENSO dynamics’). The development of a typical EP event 
can be divided into different seasonal stages, which each contribute 
differently to the boreal winter 6-9 month SSTA forecasting skill in the 
Nifio 3.4 region (central to eastern Pacific), as illustrated by the anomaly 
correlation coefficient skill between seasonal forecasts performed with 
the North American Multimodel Ensemble®” (NMME) and the obser- 
vations (Fig. 4a, cyan dashed line). Prior to boreal spring, a charged 
western tropical Pacific heat content is a necessary condition for the 
subsequent development of El Nifio events (Fig. 4b). Thus, the corre- 
sponding warm pool heat advection processes have a key role in 
determining the long-term memory for ENSO forecasts. Furthermore, 
atmospheric precursors in the North*’ and South Pacific*’, the Indian 
Ocean” or the tropical Atlantic!””' have been suggested to influence 
the El Nifio evolution for long lead times. 

It must be emphasized here that the presence of such early oceanic 
or atmospheric precursors is usually not sufficient for El Nifio growth 
because one of the key trigger mechanisms is the stochastic WWE 
activity in boreal spring and early summer”. This is clearly illustrated 
by the fact that even though initial heat-content conditions were favour- 
able for El Niftio development in early 2012, 2014 and 2017, the sub- 
sequent SSTA growth stayed below expectations. Individual WWEs are 
not predictable beyond the weather prediction horizon, which implies 
that, on average, forecasts initialized in boreal spring have relatively 
low long-term skill”, in particular in the absence of precursor signals 
(Fig. 4a, cyan line). However, precursor signals in western tropical 
Pacific heat content (Fig. 4b) could be indicative of potentially devel- 
oping El Nifio conditions, which in effect enhance the predictability 
(Fig. 4a, orange line) relative to the averaged case (Fig. 4a, cyan line). 
The competing roles of stochasticity versus ocean memory for this 
so-called spring predictability barrier (Fig. 4a) and for long-lead-time 
forecasts have been intensely debated’*”>. 

If a sufficient amount of westerly momentum is transferred in 
boreal spring from the atmosphere to the ocean, zonal advective pro- 
cesses begin moving the warm pool front eastwards, and downwelling 
Kelvin waves (Box 1) generate surface warming in the eastern tropical 
Pacific about two months later. These anomalies are further intensi- 
fied (Fig. 4a) owing to increasing summer air-sea coupling strength 
(Fig. 1j), while anomalously warm water is drained from the Western 
Pacific Warm Pool (Fig. 4b). This phase exhibits a high degree of 
climate predictability, as documented by the high anomaly correlation 
coefficients (>0.6) between predicted boreal winter El Nifio events and 
observations for forecasts initialized in boreal summer (Fig. 4a). The 
subsequent demise of an El Nifio event is largely controlled by ocean 
subsurface processes and the discharge of zonal heat content away from 
the Equator (Figs. 1i, 2, equation (1)), as well as by the seasonally mod- 
ulated southward shift of westerly wind anomalies®, which in turn 
leads to a relaxation of the zonally integrated thermocline anomalies. 
This seasonally locked decay of El Nifio conditions under a low-noise 
atmospheric environment further contributes to the long-term aver- 
aged ENSO prediction skill®. 
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The subsequent evolution into a La Nifia state (Fig. 4a—-c) and the 
possibility of having multi-year La Nifa events (Figs. 1k, 4c) are less 
well understood. La Nifia events are often preceded by a strong El 
Nifio. However, as indicated by the broad probability distribution of 
the SSTA ata lag of 9-15 months (Fig. 4c), other initial conditions can 
also develop into La Nifa events peaking in boreal winter (Fig. 4c). 
During the transition from El Nifio to La Nifa, equatorial heat gets 
quickly discharged, and 6-9 months before a peak La Nifia in boreal 
winter, we observe the smallest values of the equatorial heat content 
(Fig. 1i) and a slow recharging tendency of the Western Pacific Warm 
Pool (Fig. 4d). However, during this period the probability density of 
western tropical Pacific heat-content anomalies is relatively broad, 
which translates into an overall reduction of predictive skill (Fig. 4c). 
Because longer-lasting La Nifia events are exposed to a variety of atmos- 
pheric and oceanic perturbations and the annual cycle, a dynamical 
decoupling of La Nifia and subsequent El Nifio events may occur”®. 
In boreal winter, during the peak of the La Nia, the Western Pacific 
Warm Pool is fully charged (Fig. 4d) to values that are typical for an El 
Nifio precursor (Fig. 4b). However, the SST conditions do not necessar- 
ily have to swing back to an El Nifo state and sometimes even a second 
La Nifia can develop. When comparing the anomaly correlation coef- 
ficient skill for December La Nifia target conditions with the averaged 
skill for all years (1980-2015) from the NUME”, we find very little 
difference (Fig. 4c) for lead times of 1-12 months, which suggests that 
(i) La Nifia conditions have a considerably lower predictability than El 
Niiio, (ii) the predictability of La Nifa is to a first order captured well 
by the mean statistical skill of the current generation of seasonal pre- 
diction models. Using ensemble forecasting techniques, a recent study”® 
identified potential predictors for the likelihood of multi-year La Niha 
events, which include the magnitude of thermocline discharge and the 
amplitude of the preceding El Nifio event, suggesting the possibility for 
longer-term forecasts also for La Nifia. 

How the different stages of predictability differ between CP and 
EP events and whether there are distinct precursor patterns for 
different ENSO flavours still remains controversial®”?. Despite an 
improved understanding of ENSO dynamics, the ENSO prediction 
skill has not demonstrated a steady improvement during the past few 
decades, with even a decrease at the turn of the 21st century”? . This 
decrease may be related to the reduced ENSO amplitude and the more 
frequent occurrence of CP events during that period”, as their evo- 
lution and climate impacts tend to be less predictable than those of 
EP El Nijito events®’. 


A unifying framework 

The previous discussion has highlighted a variety of dynamical path- 
ways that can be combined to explain the spatio-temporal complexity of 
the ENSO phenomenon (Fig. 5). Extending beyond the simple single- 
mode theory (equation (1); ‘A conceptual view of ENSO dynamics’), 
which captures several—but not all—features of ENSO dynamics, our 
proposed framework for ENSO complexity is based on the co-existence 
of a duplet of linear eigenmodes (Fig. 5a, b), which can be derived 
from a deterministic, intermediate-complexity tropical atmosphere 
and ocean model“ and a number of excitation mechanisms. These 
two generic coupled eigenmodes are characterized by spatial patterns 
that closely resemble the observed EP and CP modes (Fig. 5) and by 
timescales of approximately four and two years, respectively. The four- 
year (quasi-quadrennial) mode is more prominent (Fig. 5, lower left) 
when the mean thermocline is deep and the trade winds are weak, 
and it relies strongly on thermocline feedback. By contrast, the two- 
year (quasi-biennial) mode is dominant when the mean thermocline 
is shallow and the equatorial trade winds are strong. Its SST variability 
is strongly controlled by zonal advective feedback*®. These features are 
akin to their observational counterparts (Figs. 3, 5c, d). For realistic 
background states, both modes operate not far from criticality (zero 
growth rate) (Fig. 5, lower left), which implies that they can be easily 
excited by other processes. Their stability and excitability depends fur- 
ther on the prevailing climatic background conditions. 


26 JULY 2018 | VOL 559 | NATURE | 541 


© 2018 Springer Nature Limited. All rights reserved. 


REVIEW 


QQ mode 


Depth anomaly (m) 
Z20 anomaly (m) 


120° E 


QB mode 


20°S 


1997/1998 EP El Nifio 


180° 120° W 


2009 CP EI Nifio 


Indian Ocean, Atlantic Ocean and other external forcings 


ce -4ob 
£ E 

= — 

3 > 

5 E 

O. O(bosssseosese cece co ohoeceoee a 
2 2 

= 3 

s N 

- 40h 

! \ 60F ! | 
180° 120° W 120°E 180° 120° W 
1.0 f 


-1.0 -05 0.0 0.5 


Temperature (°C) 


QB mode growth rate 


QD 

8 | 

& 400 / a y/ 

6} y 

= / J pe g E 

< 90 y 1) = 

C — ms 5 

= VT ! | i 5 
130 140 150 130 140 S 

Upper layer depth (H; m) N 


2010 La Nifa 
é ACY 


<——__ eae 
-06 0 0.6 1.2 1.8 2.4 


Growth rate (yr) 


Fig. 5 | Mechanisms of ENSO complexity. a, b, Leading two eigenmodes 
of tropical Pacific SSTA and equatorial thermocline depth anomalies 
(averaged between 5° S—5° N) with periods of about 4 yr (QQ, quasi- 
quadrennial) and about 2 yr (QB, quasi-biennial), calculated fom an 
intermediate ENSO model**. The differences in zonal location of the 
centre in SSTAs and thermocline anomalies are largely due to the different 
roles of the zonal advective feedback (ZAF) and thermocline feedback 
(TF). ¢, Growth rates of the two eigenmodes as a function of the mean 
thermocline depth, H, and the mean strength of equatorial trade winds 
relative to climatological conditions. Black dots mark the mean state for 
the modes displayed in a and b. d-f, Patterns of SSTAs”® and equatorial 
Tropical Atmosphere Ocean (TAO)/ Triangle Trans-Ocean Buoy Network 


At the heart of our explanation for the spatial flavours of ENSO is the 
aforementioned multiplicity of coupled ENSO eigenmodes (Fig. 5a, b), 
as seen in an intermediate ENSO model*®. Furthermore, the temporal 
complexity is generated in part by the different oscillation frequencies 
of the quasi-quadrennial and quasi-biennial modes and additionally by 
different external excitation processes. Such processes are associated, 
for example, with the North and South Pacific meridional modes®! 82, 
the South Pacific booster*®, WWEs (see ‘Space-time complexity of 
ENSO’ and ‘Seasonal ENSO dynamics’), tropical instability waves or 
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er a0 Ww 
(TRITON) 20 °C thermocline depth anomalies for typical EP (1997/1998) 
and CP (2009/2010) El Nifio and La Nifia (boreal winter 2010) events 
(November-January), with schematic representations of the key excitation, 
nonlinear and cross-scale interaction mechanisms: annual cycle (ACY), 
WWEs, South Pacific booster (SPB), North and South Pacific meridional 
modes (NPMM and SPMM, respectively) and tropical instabilty waves 
(TIW). The solid red, eastward (blue, westward) arrows represent the ZAF 
and the red, upward (blue, downward) arrows denote the TF for El Nifio 
(La Nifia) conditions. The relative sizes and different zonal positions of 
the arrows indicate qualitatively the strength and areas of strong feedback 
efficiency. Curly upward (downward) arrows denote damping net surface 
heat flux (HF) feedback for El Nifio (La Nifia). 


transbasin influences” (Fig. 5). In particular, asymmetric dependencies 
related to the increased WWE activity during El Nifio and enhanced 
tropical instability wave activity during La Nifia make these cross-scale 
interactions very effective sources for ENSO complexity. Furthermore, 
the annual cycle of winds and SSTs has a key role in determining the 
seasonal timing of ENSO anomalies and its predictability (see ‘Seasonal 
ENSO dynamics’ and ‘ENSO predictability’). To further explain the 
fact that El Nifio anomalies are stronger in amplitude (see ‘A concep- 
tual view of ENSO dynamics’) and exhibit a more pronounced spatial 
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diversity (see ‘Space-time complexity of ENSO’) and higher predicta- 
bility* relative to their La Nifia counterparts (Figs. 4b, d, 5c, e), we need 
to invoke additional nonlinear processes. Nonlinearities, particularly in 
atmospheric deep convection and oceanic heat advection, can induce 
a wide range of additional timescales and new spatial structures***° 
by potentially coupling or amplifying the duplet of ENSO eigenmodes. 

Decadal subsurface processes*”** can affect the long-term climato- 
logical background state. In turn, this background state changes the 
stability of the two primary ENSO eigenmodes (Fig. 5a, b) and their 
excitability. Hence, slow background state changes in the Pacific Ocean 
can have a key role in generating and modulating the spatio-temporal 
complexity of ENSO. 

Our unifying framework for ENSO complexity (Fig. 5), which iden- 
tifies key factors for ENSO complexity (primary ENSO eigenmodes, 
excitation processes, nonlinearities and cross-timescale interactions), 
may serve as a roadmap for further hypothesis testing, process studies 
and diagnostic analysis of climate models. It can also help guide the 
evolution of the tropical Pacific observing system, which is essential 
for underpinning ENSO research and forecasting®’. In addition, this 
framework can be used to determine how the shortcomings in rep- 
resenting ENSO complexity in climate and Earth system models are 
related to a variety of feedback processes and biases in the mean state 
and annual cycle that affect the generation of climate variability. 


Outlook 

The reliability of dynamical seasonal climate predictions depends heavily 
on the representation of ENSO processes in CGCMs and on the con- 
tinuous improvement of the global ocean observing system. Climate 
models still exhibit stubborn climate biases in the eastern equatorial 
Pacific” that may affect their representation of feedbacks (see ‘A con- 
ceptual view of ENSO dynamics’) and ENSO complexity”, as well as 
the fidelity of operational ENSO forecasts. Identifying and resolving 
underlying systematic model biases will help in developing the next 
generation of models for seamless climate forecasts and projections. 

Future research on ENSO complexity needs to address the role of 
the seasonal cycle for CP ENSO dynamics, the near-absence of spatial 
diversity for La Nifia’’ (Fig. 5e), the impact of decadal background-state 
changes on ENSO modes vis-a-vis multiple-timescale processes involv- 
ing WWEs, tropical instability waves, extratropical triggers, as well as 
the response of the spatio-temporal complexity of ENSO to past and 
future climate change. Moreover, whether the underlying dynamical 
origin of spatio-temporal diversity in CGCMs can in fact be linked to 
the duplet of quasi-quadrennial and quasi-biennial ENSO eigenmodes 
must be investigated. This can be tested by applying interactive atmos- 
phere ensemble averaging techniques in coupled climate models”’, 
which artificially reduce non-SST-related atmospheric perturbations. 
Moreover, the use of flux-adjusted CGCMs” could help elucidate how 
model biases impact the spatial diversity of ENSO and provide a more 
effective way of improving seasonal climate predictions. Such experi- 
ments could reveal if there are distinct precursors for ENSO diversity, 
which could be used to further inform ENSO forecasts. Much scientific 
emphasis has been placed on understanding the growth of El Nifo 
events. However, given the severe impacts of La Nifa—for example, on 
drought in the southwestern United States? or the Horn of Africa”*— 
and the fact that La Nifia events may last longer than one year (Fig. 4), 
it will be paramount to gain also deeper understanding of the processes 
controlling La Nia and its predictability through observational, diag- 
nostic and modelling studies. 

A growing global population in the 21st century has become increas- 
ingly vulnerable to natural hazards as human activities alter the climate 
and the environment. Society therefore has an urgent demand for better 
climate products and services, including improved ENSO monitoring 
and predictions and long-term projections, to better inform decision 
making for agriculture and food security, public health, water resource 
management, energy production, human migration and disaster risk 
reduction. Because ENSO involves a broad range of disciplines, from 
atmosphere and ocean dynamics, to ecosystems and societal impacts, 


REVIEW 


it is a unifying concept in Earth system science”’. Thus, our proposed 
framework for ENSO complexity can serve as both a catalyst to further 
research and, in its practical applications, an essential contributor for 
sustainable development and environmental stewardship in a changing 
world. 
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Machine learning for molecular and 


materials science 


Keith T. Butler!, Daniel W. Davies’, Hugh Cartwright’, Olexandr Isayev** & Aron Walsh>°* 


Here we summarize recent progress in machine learning for the chemical sciences. We outline machine-learning 
techniques that are suitable for addressing research questions in this domain, as well as future directions for the field. 
We envisage a future in which the design, synthesis, characterization and application of molecules and materials is 


accelerated by artificial intelligence. 


he Schrédinger equation provides a powerful structure- 

property relationship for molecules and materials. For a given 

spatial arrangement of chemical elements, the distribution of 
electrons and a wide range of physical responses can be described. The 
development of quantum mechanics provided a rigorous theoretical 
foundation for the chemical bond. In 1929, Paul Dirac famously proclaimed 
that the underlying physical laws for the whole of chemistry are “completely 
known”!. John Pople, realizing the importance of rapidly developing 
computer technologies, created a program—Gaussian 70—that could 
perform ab initio calculations: predicting the behaviour, for molecules 
of modest size, purely from the fundamental laws of physics”. In the 1960s, 
the Quantum Chemistry Program Exchange brought quantum chemistry 
to the masses in the form of useful practical tools*. Suddenly, experi- 
mentalists with little or no theoretical training could perform quantum 
calculations too. Using modern algorithms and supercomputers, 
systems containing thousands of interacting ions and electrons can now 
be described using approximations to the physical laws that govern the 
world on the atomic scale**, 

The field of computational chemistry has become increasingly pre- 
dictive in the twenty-first century, with activity in applications as wide 
ranging as catalyst development for greenhouse gas conversion, materials 
discovery for energy harvesting and storage, and computer-assisted drug 
design’. The modern chemical-simulation toolkit allows the properties 
of a compound to be anticipated (with reasonable accuracy) before it has 
been made in the laboratory. High-throughput computational screening 
has become routine, giving scientists the ability to calculate the properties 
of thousands of compounds as part of a single study. In particular, den- 
sity functional theory (DFT)*”, now a mature technique for calculating 
the structure and behaviour of solids!®, has enabled the development of 
extensive databases that cover the calculated properties of known and 
hypothetical systems, including organic and inorganic crystals, single 
molecules and metal alloys!!"1¥, 

The emergence of contemporary artificial-intelligence methods has 
the potential to substantially alter and enhance the role of computers in 
science and engineering. The combination of big data and artificial intel- 
ligence has been referred to as both the “fourth paradigm of science” '* 
and the “fourth industrial revolution”!’, and the number of applications 
in the chemical domain is growing at an astounding rate. A subfield of 
artificial intelligence that has evolved rapidly in recent years is machine 
learning. At the heart of machine-learning applications lie statistical algo- 
rithms whose performance, much like that of a researcher, improves with 
training. There is a growing infrastructure of machine-learning tools for 


generating, testing and refining scientific models. Such techniques are 
suitable for addressing complex problems that involve massive combi- 
natorial spaces or nonlinear processes, which conventional procedures 
either cannot solve or can tackle only at great computational cost. 

As the machinery for artificial intelligence and machine learning 
matures, important advances are being made not only by those in main- 
stream artificial-intelligence research, but also by experts in other fields 
(domain experts) who adopt these approaches for their own purposes. As 
we detail in Box 1, the resources and tools that facilitate the application 
of machine-learning techniques mean that the barrier to entry is lower 
than ever. 

In the rest of this Review, we discuss progress in the application of 
machine learning to address challenges in molecular and materials 
research. We review the basics of machine-learning approaches, iden- 
tify areas in which existing methods have the potential to accelerate 
research and consider the developments that are required to enable more 
wide-ranging impacts. 


Nuts and bolts of machine learning 

With machine learning, given enough data and a rule-discovery algo- 
rithm, a computer has the ability to determine all known physical laws 
(and potentially those that are currently unknown) without human 
input. In traditional computational approaches, the computer is little 
more than a calculator, employing a hard-coded algorithm provided 
by a human expert. By contrast, machine-learning approaches learn 
the rules that underlie a dataset by assessing a portion of that data 
and building a model to make predictions. We consider the basic steps 
involved in the construction of a model, as illustrated in Fig. 1; this 
constitutes a blueprint of the generic workflow that is required for the 
successful application of machine learning in a materials-discovery 
process. 


Data collection 

Machine learning comprises models that learn from existing (train- 
ing) data. Data may require initial preprocessing, during which miss- 
ing or spurious elements are identified and handled. For example, the 
Inorganic Crystal Structure Database (ICSD) currently contains more 
than 190,000 entries, which have been checked for technical mistakes 
but are still subject to human and measurement errors. Identifying 
and removing such errors is essential to avoid machine-learning 
algorithms being misled. There is a growing public concern about 
the lack of reproducibility and error propagation of experimental data 
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Box | 
Learning to learn 


One of the most exciting aspects of machine-learning techniques is 
their potential to democratize molecular and materials modelling 
by reducing the computer power and prior knowledge required for 
entry. Just as Pople’s Gaussian software made quantum chemistry 
more accessible to a generation of experimental chemists, 
machine-learning approaches, if developed and implemented 
correctly, can broaden the routine application of computer 

models by non-specialists. The accessibility of machine-learning 
technology relies on three factors: open data, open software 

and open education. There is an increasing drive for open data 
within the physical sciences, with an ideal best practice outlined 
recently?®99, Some of the open software being developed is listed 
in Table 2. There are also many excellent open education resources 
available, such as massive open online courses (MOOCs). 

fast.ai (http://www.fast.ai) is a course that is “making neural nets 
uncool again” by making them accessible to a wider community of 
researchers. One of the advantages of this course is that users start 
to build working machine-learning models almost immediately. 
However, it is not for absolute beginners, requiring a working 
knowledge of computer programming and high-school-level 
mathematics. 

DataCamp (https://www.datacamp.com) offers an excellent 
introduction to coding for data-driven science and covers many 
practical analysis tools relevant to chemical datasets. This course 
features interactive environments for developing and testing code 
and is suitable for non-coders because it teaches Python at the 
same time as machine learning. 

Academic MOOCs are useful courses for those wishing to get 
more involved with the theory and principles of artificial intelligence 
and machine learning, as well as the practice. The Stanford MOOC 
(https://www.coursera.org/learn/machine-learning) is popular, 
with excellent alternatives available from sources such as https:// 
www.edx.org (see, for example, ‘Learning from data (introductory 
machine learning)’) and https://www.udemy.com (search for 
‘Machine learning A-Z’). The underlying mathematics is the topic of 
a course from Imperial College London (https://www.coursera.org/ 
specializations/mathematics-machine-learning). 

Many machine-learning professionals run informative blogs 
and podcasts that deal with specific aspects of machine-learning 
practice. These are useful resources for general interest as well as 
for broadening and deepening knowledge. There are too many 
to provide an exhaustive list here, but we recommend https:// 
machinelearningmastery.com and http://lineardigressions.com as 
a starting point. 


published in peer-reviewed scientific literature. In certain fields, such 
as cheminformatics, best practices and guidelines have been established 
to address these problems’. 

The training of a machine-learning model may be supervised, 
semi-supervised or unsupervised, depending on the type and amount 
of available data. In supervised learning, the training data consist of sets 
of input and associated output values. The goal of the algorithm is to 
derive a function that, given a specific set of input values, predicts the 
output values to an acceptable degree of fidelity. If the available dataset 
consists of only input values, unsupervised learning can be used in 
an attempt to identify trends, patterns or clustering in the data. Semi- 
supervised learning may be of value if there is a large amount of input 
data, but only a limited amount of corresponding output data. 

Supervised learning is the most mature and powerful of these 
approaches, and is used in the majority of machine-learning studies in 
the physical sciences, such as in the mapping of chemical composition 
to a property of interest. Unsupervised learning is less common, but 
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can be used for more general analysis and classification of data or to 
identify previously unrecognized patterns in large datasets’’. 


Data representation 

Even though raw scientific data are usually numerical, the form in 
which data are presented often affects learning. In many types of 
spectroscopy, the signal is acquired in the time domain, but for inter- 
pretation it is converted to the frequency domain using the Fourier 
transform. Like scientists, a machine-learning algorithm might learn 
more effectively using one format than the other. The process of con- 
verting raw data into something more suitable for an algorithm is called 
featurization or feature engineering. 

The more suitable the representation of the input data, the more 
accurately can an algorithm map it to the output data. Selecting how 
best to represent the data could require insight into both the underlying 
scientific problem and the operation of the learning algorithm, because 
it is not always obvious which choice of representation will give the best 
performance; this is an active topic of research for chemical systems"®. 

Many representations are available to encode structures and properties. 
One example is the Coulomb matrix!®, which contains information 
on atomic nuclear repulsion and the potential energy of free atoms, 
and is invariant to molecular translations and rotation. Molecular 
systems also lend themselves to descriptions as graphs”. In the solid 
state, the conventional description of crystal structures that uses trans- 
lation vectors and fractional coordinates of the atoms is not appro- 
priate for machine learning because a lattice can be represented in an 
infinite number of ways by choosing a different coordinate system. 
Representations based on radial distribution functions?!, Voronoi 
tessellations” or property-labelled materials fragments”* are among 
the new ways in which this problem is being tackled. 


Choice of learner 

When the dataset has been collected and represented appropriately, it 
is time to choose a model to learn from it. A wide range of model types 
(or learners) exists for model building and prediction. Supervised- 
learning models may predict output values within a discrete set (such 
as categorizing a material as a metal or an insulator) or a continuous 
set (such as polarizability). Building a model for the former requires 
classification, whereas the latter requires regression. A range of learn- 
ing algorithms can be applied (see Table 1), depending on the type of 
data and the question posed. It may be helpful to use an ensemble of 
different algorithms, or of similar algorithms with different values for 
their internal parameters (known as ‘bagging’ or ‘stacking’), to create a 
more robust overall model. We outline some of the common algorithms 
(learners) in the following. 

Naive Bayes classifiers” are a collection of classification algorithms 
based on Bayes’ theorem that identify the most probable hypothesis, 
given the data as prior knowledge about the problem. Bayes’ theorem 
provides a formal way of calculating the probability that a hypothesis 
is correct, given a set of existing data. New hypotheses can then be 
tested and the prior knowledge updated. In this way, the hypothesis 
(or model) with the highest probability of correctly representing the 
data can be selected. 

In k-nearest-neighbour”’ methods, the distances between samples 
and training data in a descriptor hyperspace are calculated. They are 
so called because the output value for a prediction relies on the values 
of the k ‘nearest neighbours in the data, where k is an integer. Nearest- 
neighbour models can be used in both classification and regression 
models: in classification, the prediction is determined by the class of 
the majority of the k nearest points; in regression, it is determined by 
the average of the k nearest points. 

Decision trees”® are flowchart-like diagrams used to determine a 
course of action or outcome. Each branch of the tree represents a pos- 
sible decision, occurrence or reaction. The tree is structured to show 
how and why one choice may lead to the next, with branches indicating 
that each option is mutually exclusive. Decision trees comprise a root 
node, leaf nodes and branches. The root node is the starting point of 
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Fig. 1 | Evolution of the research workflow in computational chemistry. 
The standard paradigm in the first-generation approach is to calculate 

the physical properties of an input structure, which is often performed 

via an approximation to the Schrodinger equation combined with local 
optimization of the atomic forces. In the second-generation approach, 

by using global optimization (for example, an evolutionary algorithm) 

an input of chemical composition is mapped to an output that contains 
predictions of the structure or ensemble of structures that the combination 
of elements are likely to adopt. The emerging third-generation approach 
is to use machine-learning techniques with the ability to predict 
composition, structure and properties provided that sufficient data are 
available and an appropriate model is trained. Four stages of training a 
machine-learning model with some of the common choices are listed in 
the bottom panel. 


the tree. Both root and leaf nodes contain questions or criteria to be 
addressed. Branches are arrows connecting nodes, showing the flow 
from question to answer. Decision trees are often used in ensemble 
methods (meta-algorithms), which combine multiple trees into one 
predictive model to improve performance. 

Kernel methods are a class of algorithms, the best known members 
of which are support vector machine and kernel ridge regression”’. The 
name ‘kernel’ comes from the use of a kernel function—a function that 
transforms input data into a higher-dimensional representation that 
makes the problem easier to solve. In a sense, a kernel is a similarity 
function provided by the domain expert: it takes two inputs and creates 
an output that quantifies how similar they are. 

Artificial neural networks and deep neural networks”* loosely mimic 
the operation of the brain, with artificial neurons (the processing unit) 
arranged in input, output and hidden layers. In the hidden layers, each 
neuron receives input signals from other neurons, integrates those 
signals and then uses the result in a straightforward computation. 
Connections between neurons have weights, the values of which rep- 
resent the stored knowledge of the network. Learning is the process 
of adjusting the weights so that the training data are reproduced as 
accurately as possible. 

Whatever the model, most learners are not fully autonomous, requiring 
at least some guidance. The values of internal variables (hyperparameters) 
are estimated beforehand using systematic and random searches, or 
heuristics. Even modest changes in the values of hyperparameters can 
improve or impair learning considerably, and the selection of optimal 
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values is often problematic. Consequently, the development of auto- 
matic optimization algorithms is an area of active investigation, as is 
their incorporation into accessible packages for non-expert users (see 
Table 2). 


Model optimization 

When the learner (or set of learners) has been chosen and predictions 
are being made, a trial model must be evaluated to allow for optimiza- 
tion and ultimate selection of the best model. Three principal sources of 
error arise and must be taken into account: model bias, model variance 
and irreducible errors, with the total error being the sum of these. Bias 
is the error from incorrect assumptions in the algorithm and can result 
in the model missing underlying relationships. Variance is sensitivity 
to small fluctuations in the training set. Even well-trained machine- 
learning models may contain errors due to noise in the training data, 
measurement limitations, calculation uncertainties, or simply outliers 
or missing data. Poor model performance usually indicates a high bias 
or a high variance, as illustrated in Fig. 2. 

High bias (also known as underfitting) occurs when the model is not 
flexible enough to adequately describe the relationship between inputs 
and predicted outputs, or when the data are insufficiently detailed to 
allow the discovery of suitable rules. High variance (or overfitting) 
occurs when a model becomes too complex; typically, this occurs as 
the number of parameters is increased. The diagnostic test for over- 
fitting is that the accuracy of a model in representing training data 
continues to improve, while the performance in estimating test data 
plateaus or declines. 

The key test for the accuracy of a machine-learning model is its 
successful application to unseen data. A widely used method for 
determining the quality of a model involves withholding a randomly 
selected portion of data during training. This withheld dataset, known 
as a test set, is shown to the model once training is complete (Fig. 2). 
The extent to which the output data in the validation set is accurately 
predicted then provides a measure of the effectiveness of training. 
Cross-validation is reliable only when the samples used for training 
and validation are representative of the whole population, which may 
present problems if the sample size is small or if the model is applied 
to data from compounds that are very different to those in the original 
dataset. A careful selection of methods for evaluating the transferability 
and applicability of a model is required in such cases. 


Accelerating the scientific method 

Whether through the enumeration and analysis of experimental data 
or the codification of chemical intuition, the application of informatics 
to guide laboratory chemists is advancing rapidly. In this section, we 
explore how machine learning is helping to progress and to reduce the 
barriers between chemical and materials design, synthesis, character- 
ization and modelling. We also describe some of the important devel- 
opments in the field of artificial intelligence for data-mining existing 
literature. 


Guiding chemical synthesis 

Organic chemists were among the first scientists to recognize the 
potential of computational methods in laboratory practice. E. J. Corey’s 
Organic Chemical Simulation of Synthesis (OCSS) program”’, devel- 
oped 50 years ago, was an attempt to automate retrosynthetic analysis. 
Ina synthetic chemistry route, the number of possible transformations 


Table 1 | Classes of machine-learning techniques and some chemical questions they could answer 


Class Bayesian Evolutionary# 


Symbolist 


Connectionist Analogist 


Method 


Algorithms include 


Probabilistic inference Evolving structures 


Naive Bayes 
Bayesian networks 


Genetic algorithm 
Particle swarm 


Chemical query Is my new theory valid? What molecule gives 


this property? 


Logical inference 


Rules 
Decision trees 


How do | make this 
material? 


Pattern recognition Constrained optimization 


Artificial neural networks 
Back propagation 


What compound did | 
synthesize? 


Nearest neighbour 
Support vectors 


Find a structure—-property 
relation 


The classes shown were chosen following ref. 9”. 


@Although evolutionary algorithms are often integrated into machine-learning procedures, they form part of a wider class of stochastic search algorithms. 
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Table 2 | Publicly accessible learning resources and tools related to machine learning 


Name Description URL 
General-purpose machine-learning frameworks 
Caret Package for machine learning in R https://topepo.github.io/caret 


Deeplearning4j Distributed deep learning for Java 


+ 


https://deeplearning4j.org 


H20.ai Machine-learning platform written in Java that can be imported as a Python or R library https://h2o.ai 
Keras High-level neural-network API written in Python https://keras.io 
Mlpack Scalable machine-learning library written in C++ https://mlpack.org 
Scikit-learn Machine-learning and data-mining member of the scikit family of toolboxes built around the —_http://scikit-learn.org 
SciPy Python library 
Weka Collection of machine-learning algorithms and tasks written in Java https://cs.waikato.ac.nz/ml/weka 


Machine-learning tools for molecules and materials 


Amp Package to facilitate machine learning for atomistic calculations https://bitbucket.org/andrewpeterson/amp 
ANI eural-network potentials for organic molecules with Python interface https://github.com/isayev/ASE_ANI 

COMBO Python library with emphasis on scalability and efficiency https://github.com/tsudalab/combo 
DeepChem Python library for deep learning of chemical systems https://deepchem.io 

GAP Gaussian approximation potentials http://libatoms.org/Home/Software 
MatMiner Python library for assisting machine learning in materials science https://hackingmaterials.github.io/matminer 
NOMAD Collection of tools to explore correlations in materials datasets https://analytics-toolkitnomad-coe.eu 
PROPhet Code to integrate machine-learning techniques with quantum-chemistry approaches https://github.com/biklooost/PROPhet 
TensorMol eural-network chemistry package https://github.com/jparkhill/TensorMol 


per step can range from around 80 to several thousand”; for compar- 
ison, there are only tens of potential moves at each game position in 
chess*". In chemical synthesis, human experts are required to specify 
conditional and contextual rules, which exclude large sets of potential 


Training score 
Test score 


Error 


Underfitting Overfitting 


Model complexity 


Fig. 2 | Errors that arise in machine-learning approaches. Errors 

can arise during both the training of a new model (blue line) and the 
application of a built model (red line). A simple model may suffer from 
high bias (underfitting), whereas a complex model may suffer from high 
variance (overfitting), which leads to a bias—variance trade-off. In the 
underfitting region the model performance can improve with further 
parameterization, whereas in the overfitting region the model performance 
will decrease. The optimal point for a model is just before the performance 
on the testing set starts to deteriorate with increased parameterization, 
which is indicated by the dashed vertical line. The model shown here is 
built on an example from https://kaggle.com, available at https://keeeto. 
github.io/blog/bias_variance. The shaded areas show the standard 
deviations of the fits for model training (blue) and testing (red). 
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reagents at a given step, limiting the number of choices available to the 
algorithm. The contextual rules (typically many thousands of them) are 
of utmost importance if a machine relying on a traditional algorithm 
is to compete with an expert. Recent breakthroughs in the Chematica 
program have shown that computers can be more efficient than humans 
in these tasks*”. 

The combination of extremely complex systems and huge num- 
bers of potential solutions, which arise from competing objective 
functions (such as cost, purity, time and toxicity), make synthetic chem- 
istry ill-suited to the application of traditional algorithmic approaches. 
However, because of this complexity, synthesis is one area of research 
that can benefit most from the application of artificial intelligence. 

Deep-learning approaches, which typically rely on many-layered 
artificial neural networks or a combination of artificial neural networks 
and other learning techniques such as Boltzmann machines, are 
showing particular promise for predicting chemical-synthesis routes 
by combining rules-based expert systems with neural networks that 
rank the candidate synthetic pathways*? or the likelihood of a pre- 
dicted product by applying the rules**. One artificial neural network 
that learned from examples taken from the chemical literature was able 
to achieve a level of sophistication such that trained chemists could not 
distinguish between computer- and human-expert-designed routes*”. 
However, a severe drawback of rules-based systems is that they have 
difficulty operating outside their knowledge base. 

Alternatives to rules-based synthesis prediction have also been 
proposed, for example, so-called sequence-to-sequence approaches, 
which are based on the relationships between organic chemistry and 
linguistics. By casting molecules as text strings, these relationships 
have been applied in several chemical-design studies****. In sequence- 
to-sequence approaches, a model is fed an input of products and then 
outputs reactants as a SMILES string®’”. A similar approach has also 
been applied to retrosynthesis**. Future developments in areas such as 
one-shot learning (as recently applied to drug discovery)? could lead to 
wider application of non-rules-based methods in fields such as natural 
product synthesis, for which training data are scarce. 

Beyond the synthesis of a target molecule, machine-learning models 
can be applied to assess the likelihood that a product will crystallize. 
By applying feature-selection techniques, a two-parameter model 
capable of predicting the propensity of a given molecule to crystallize 
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with an accuracy of around 80% has been demonstrated“. Crucially, 
this model had access to a training set of more than 20,000 crystalline 
and non-crystalline compounds. The availability of such open-access 
databases is pivotal for the further development of similar predictive 
models*!. In another study, a model was trained to predict the reaction 
conditions for new organically templated inorganic-product formation 
with a success rate of 89%*”. 

A less explored avenue of machine learning is how to best sample the 
set of possible experimental set-ups. Active learning predicts the optimal 
future experiments that are required to better understand a given problem. 
It was recently applied to help to understand the conditions for the 
synthesis and crystallization of complex polyoxometalate clusters”. 
Starting from initial data on failed and successful experiments, the 
machine-learning approach directed future experiments and was 
shown to be capable of covering six times as much crystallization space 
as a human researcher in the same number of experiments. 

Computational assistance for the planning and direction of chemical 
synthesis has come a long way since the early days of hand-coded expert 
systems. Much of this progress has been achieved in the past five years. 
Incorporation of artificial-intelligence-based chemical planners, with 
advances in robotic synthesis’, promises a rich new frontier in the 
production of novel compounds. 


Assisting multi-dimensional characterization 

The structure of molecules and materials is typically deduced by a com- 
bination of experimental methods, such as X-ray and neutron diffrac- 
tion, magnetic and spin resonance, and vibrational spectroscopy. Each 
approach has a certain sensitivity and length scale, and information 
from each method is complementary. Unfortunately, it is rare that data 
are fully assimilated into a coherent description of atomic structure. 
Analyses of individual streams often result in conflicting descriptions 
of the same compound™. A solution could be to incorporate real-time 
data into the modelling, with results then returned to the experiment, 
forming a feedback loop*’. Machine learning represents a unifying 
framework that could enable the synergy of synthesis, imaging, theory 
and simulations. 

The power of machine-learning methods for enhancing the link 
between modelling and experiment has been demonstrated in the 
field of surface science. Complex surface reconstructions have been 
characterized by combining ab initio simulations with multi-stage 
pattern-recognition systems that use convolutional neural networks**. 
Machine-learning methods have also recently shown promise in areas 
such as microstructural characterization*” and the identification of 
interesting regions in large, complex, neutron-scattering, volumetric 
(three-dimensional) datasets*®. Another example of machine learn- 
ing opening new avenues in an area of complicated characterization is 
phase transitions of highly correlated systems; neural networks have 
been trained to encode topological phases of matter and thus identify 


transitions between them”. 


Enhancing theoretical chemistry 
Modelling is now commonly considered as being equally impor- 
tant as synthesis and characterization for successful programmes of 
research. Using atomistic simulations, the properties of a molecule or 
material can, in principle, be calculated for any chemical composition 
and atomic structure. In practice, the computations grow rapidly in 
complexity as the size of the system increases, so considerable effort 
is devoted to finding short-cuts and approximations that enable the 
properties of the material to be calculated to an acceptable degree of 
fidelity, without the need for unreasonable amounts of computer time. 
Approaches based on DFT have been successful in predicting the 
properties of many classes of compounds, offering generally high 
accuracy at reasonable cost. However, DFT and related electronic- 
structure techniques are limited by the exchange-correlation functional 
that describes non-classical interactions between electrons. There are 
notable limitations of the current approximations for weak chemical 
interactions (such as in layered materials), for highly correlated (d and 
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f electron) systems and for the latest generation of quantum materials 
(such as iron pnictide superconductors), which often require a more 
sophisticated many-body Hamiltonian. Drawing from the growing 
number of structure-property databases (Table 3), accurate univer- 
sal density functionals can be learned from data*”?'. Early examples 
include the Bayesian error-estimation functional” and combinatorially 
optimized DFT functionals**. Going beyond the standard approach to 
DFT, the need to solve the Kohn-Sham equations can be by-passed 
by learning density-to-energy and density-to-potential maps directly 
from training systems”*. 

Equally challenging is the description of chemical processes across 
length scales and timescales, such as the corrosion of metals in the pres- 
ence of oxygen and water. A realistic description of chemical interactions 
(bond forming and breaking) including solvents, interfaces and disorder 
is still limited by the computational cost of available quantum-mechanical 
approaches. The task of developing transferrable analytic force fields is 
a well-defined problem for machine learning”*”*. It has been demon- 
strated that, in simple materials, approximate potential-energy surfaces 
learned from quantum-mechanical data can save orders of magnitude 
in processing cost*”**, Although the combination of methods with var- 
ying levels of approximation is promising, much work is needed in the 
quantification and minimization of error propagation across methods. 
In this context, initiatives for error estimation such as the DAKOTA 
package (https://dakota.sandia.gov) are critically important. 


Targeting discovery of new compounds 

We have considered how machine learning can be used to enhance and 
integrate synthesis, characterization and modelling. However, machine 
learning can also reveal new ways of discovering compounds. Models 
that relate system descriptors to desirable properties are already used 
to reveal previously unknown structure-property relationships*”. So 
far, the fields of molecular (primarily pharmaceutical and medicinal) 
and materials chemistry have experienced different degrees of uptake 
of machine-learning approaches to the design of new compounds, in 
part owing to the challenges of representing the crystal structure and 
morphology of extended solids. 


Crystalline solids. The application of machine learning to the discovery 
of functional materials is an emerging field. An early report in 1998 
applied machine learning to the prediction of magnetic and optoelec- 
tronic materials, but the number of studies has risen substantially 
only since 2010". The complexity of games like Go is reminiscent 
of certain problems in materials science, such as the description of 
on-lattice interactions that govern chemical disorder, magnetism and 
ferroelectricity. Even for representations of small unit cells, the number 
of configurations of a disordered crystal can quickly exceed the limita- 
tions of conventional approaches. An inverse-design procedure illus- 
trated how such a combinatorial space for an alloy could be harnessed 
to realize specific electronic structure features®”. Similar inverse-design 
approaches have also been applied in molecular chemistry to tailor 
ground- and excited-state properties. 

Predicting the likelihood of a composition to adopt a given crystal 
structure is a good example of a supervised classification problem in 
machine learning. Two recent examples involve predicting how likely 
a given composition is to adopt the so-called Heusler and half-Heusler 
crystal structures. The first predicts the likelihood that a given compo- 
sition will adopt the Heusler structure and is trained on experimental 
data”. This approach was applied to screen hypothetical compositions 
and successfully identified 12 new gallide compounds, which were sub- 
sequently verified experimentally. In the second, a random forest model 
was trained on experimental data to learn the probability that a given 
ABC stoichiometry would adopt the half-Heusler structure”. 

As an alternative to learning from experimental data, calculated 
properties can be used as a training set for machine learning. Assessing 
the degree of similarity between electronic band structures has been 
shown to yield improved photocathodes for dye-sensitized solar cells”’. 
A machine-learning model, trained to reproduce energies for the 
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Table 3 | Publicly accessible structure and property databases for molecules and solids 


Name 


Description 


URL 


Computed structures and properties 


AFLOWLIB 


Computational Materials Repository 


GDB 
Harvard Clean Energy Project 


Materials Project 


NOMAD 


Open Quantum Materials Database 


REL Materials Database 
EDesignLab 


4 


NC 


Z 

E 
ChEMBL 
ChemSpider 

Citrination 

Crystallography Open Database 
CSD 

CSD 


atNavi 


atWeb 


xperimental structures and properties 


Structure and property repository from high-throughput ab initio calculations 
of inorganic materials 


Infrastructure to enable collection, storage, retrieval and analysis of data from 
electronic-structure codes 


Databases of hypothetical small organic molecules 
Computed properties of candidate organic solar absorber materials 


Computed properties of known and hypothetical materials carried out using a 
standard calculation scheme 


Input and output files from calculations using a wide variety of electronic- 
structure codes 


Computed properties of mostly hypothetical structures carried out using a 
standard calculation scheme 


Computed properties of materials for renewable-energy applications 


Experimental and computed properties to aid the design of new thermo- 
electric materials 


Commercially available organic molecules in 2D and 3D formats 


Bioactive molecules with drug-like properties 


Royal Society of Chemistry’s structure database, featuring calculated and 
experimental properties from a range of sources 


Computed and experimental properties of materials 

Structures of organic, inorganic, metal-organic compounds and minerals 
Repository for small-molecule organic and metal-organic crystal structures 
norganic Crystal Structure Database 


ultiple databases targeting properties such as superconductivity and 
thermal conductance 


Datasheets for various engineering materials, including thermoplastics, semi- 


http://aflowlib.org 


https://cmr.fysik.dtu.dk 


http://gdb.unibe.ch/downloads 


https://materialsproject.org 


https://nomad-repository.eu 


http://oqmd.org 


https://materials.nrel.gov 


http://tedesignlab.org 


https://zinc15.docking.org 


https://www.ebi.ac.uk/chembl 


https://chemspider.com 


https://citrination.com 
http://crystallography.net 
https://www.ccde.cam.ac.uk 
https://icsd.fiz-karlsruhe.de 
http://mits.nims.go.jp 


http://matweb.com 


https://cepdb.molecularspace.org 


conductors and fibres 
IST Chemistry WebBook 

IST Materials Data Repository 
PubChem 


High-accuracy gas-phase thermochemistry and spectroscopic data 
Repository to upload materials data associated with specific publications 


Biological activities of small molecules 


https://webbook.nist.gov/chemistry 


https://materialsdata.nist.gov 


https://pubchem.ncbi.nIm.nih.gov 


elpasolite crystal structure (ABC D¢), was applied to screen all two 
million possible combinations of elements that satisfy the formula, 
revealing chemical trends and identifying 128 new materials”. Such 
models are expected to become a central feature in the next generation 
of high-throughput virtual screening procedures. 

The majority of crystal-solid machine-learning studies so far have 
concentrated on a particular type of crystal structure. This is because 
of the difficulty of representing crystalline solids in a format that can 
be fed easily to a statistical learning procedure. By concentrating on 
a single structure type, the representation is inherently built into the 
model. Developing flexible, transferrable representations is one of the 
important areas of research in machine learning for crystalline solids 
(see subsection ‘Data representation). As we will see below, the use of 
machine learning in molecular chemistry is more advanced than in 
the solid state, to a large extent owing to the greater ease with which 
molecules can be described in a manner amenable to algorithmic 
interpretation. 


Molecular science. The quantitative structure-activity relationship is 
now a firmly established tool for drug discovery and molecular design. 
With the development of massive databases of assayed and virtual 
molecules’*”4, methods for rapid, reliable, virtual screening of these 
molecules for pharmacological (or other) activity are required to 
unlock the potential of such molecules. Models based on quantitative 
structure-activity relationships can be described as the application of 
statistical methods to the problem of finding empirical relationships of 
the type P,=k’(D, D, ..., D,,), where P; is the property of interest, k’ is a 
(typically linear) mathematical transformation and D;are calculated or 
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measured structural properties”°. Machine learning has a long history 
in the development of quantitative structure—activity relationships, 
stretching back over halfa century”®. 

Molecular science is benefitting from cutting-edge algorithmic devel- 
opments in machine learning such as generative adversarial networks”” 
and reinforcement learning for the computational design of biologically 
active compounds. In a generative adversarial network, two models 
are trained simultaneously: a generative model (or generator) captures 
the distribution of data while a discriminative model (or discrimina- 
tor) estimates the probability that a sample came from the training set 
rather than the generator. The training procedure for the generator 
is to maximize the probability of the discriminator making an error 
(Fig. 3). Models based on objective-reinforced generative adversar- 
ial networks” are capable of generating new organic molecules from 
scratch. Such models can be trained to produce diverse molecules that 
contain specific chemical features and physical responses, through a 
reward mechanism that resembles classical conditioning in psychology. 
Using reinforcement learning, newly generated chemical structures 
can be biased towards those with the desired physical and biological 
properties (de novo design). 


Reclaiming the literature 

A final area for which we consider the recent progress of machine learning 
(across all disciplines) is tapping into the vast amount of knowledge that 
already exists. Although the scientific literature provides a wealth of 
information to researchers, it is increasingly difficult to navigate owing 
to the proliferation of journals, articles and databases. Text mining 
has become a popular approach to identifying and extracting 
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——————> Sample 


Real-world molecules 


Latent random 
variable 


Generator —> Sample 


Fig. 3 | The generative adversarial network (GAN) approach to 
molecular discovery. Two models—a generator and a discriminator— 
play a continuous ‘game’ The generator learns to map from a latent 
random variable (noise) to a particular distribution of molecules. The 
discriminator learns to get better and better at distinguishing fake data 


information from unstructured text sources. This approach can be used 
to extract facts and relationships in a structured form to create special- 
ized databases, to transfer knowledge between domains and, more gen- 
erally, to support research decision-making”. Text mining is applied to 
answer many different research questions, including in the discovery 
of drug-protein target associations, the analysis of high-throughput 
experiments and the development of systematic materials databases®”. 
Owing to the heterogeneous nature of written resources, the automated 
extraction of relevant information is far from trivial. To address this, 
text mining has evolved into a sophisticated and specialized field where 
text-processing and machine-learning techniques are combined. 

In cases where supplementary data are provided with a publication, 
it is made available in various formats and databases, often without 
validated or standardized metadata. The issue of data and metadata 
interoperability is key. Some examples of forward-looking initiatives 
that are pushing accessible, reusable data in scientific research are The 
Molecular Sciences Software Institute (http://molssi.org) and the Open 
Science Monitor (https://ec.europa.eu/research/openscience). 


Frontiers in machine learning 

There are many opportunities for further breakthroughs in machine 
learning to provide even greater advances in the automated design and 
discovery of molecules and materials. Here we highlight some frontiers 
in the field. 


More knowledge from smaller datasets 

Machine-learning approaches typically require large amounts of data 
for learning to be effective. Although this is rarely an issue in fields such 
as image recognition, in which millions of input datasets are available, 
in chemistry or materials science we are often limited to hundreds or 
thousands, if not fewer, high-quality data points. Researchers need to 
become better at making the data associated with our publications 
accessible in a computer-readable form. Another promising solution 
to the problem of limited datasets is meta-learning, whereby knowledge 
is learned within and across problems®!. New developments such as 
neural Turing machines® and imitation learning® are enabling the 
realization of this process. A Bayesian framework has recently been 
reported to achieve human-level performance on one-shot learning 
problems with limited data**, which has consequences for molecular 
and materials science where data are sparse and generally expensive 
and slow to obtain. 


Efficient chemical representations 

The standard description of chemical reactions, in terms of composi- 
tion, structure and properties, has been optimized for human learning. 
Most machine-learning approaches for chemical reactions or properties 
use molecular or atomic descriptors to build models, the success of 


REVIEW 


Real 


Discriminator > > Loss 


Fake 


from real data. The two artificial neural networks are optimizing a 
different and opposing objective function, or loss function, in a zero-sum 
game. The general mathematical formulation for the GAN approach 

to unsupervised machine learning is outlined in ref. 7”. 


which is determined by the validity and relevance of these descriptors. A 
good descriptor must be simpler to obtain than the target property and 
of as low dimensionality as possible®. In the context of materials, useful 
descriptors® and approaches for adapting simple existing heuristics 
for machine learning have been outlined®’; however, much work 
remains to develop powerful new descriptions. In the field of molecular 
reactions, advances such as the use of neural networks to create 
fingerprints for molecules in reactions are leading to improvements 
in synthesis prediction®®. As has been demonstrated by the success- 
ful adoption of the concept of molecular fragments”, the field of 
crystalline-materials design can learn much from advances in molecular 
nomenclature and representation. 


Quantum learning 

Whereas classical computing processes bits that are either 1 or 0, quan- 
tum computers use the quantum superposition of states to process 
qubits that are both 1 and 0 at the same time®. This parallelization leads 
to an exponential speedup in computational efficiency as the number of 
(qu)bits used increases’. Computational chemistry is a strong candi- 
date to benefit from quantum computing because solving Schrédinger’s 
equation on a quantum computer that consists of interacting electrons 
has a natural fit?’. One of the challenges for quantum computing is 
knowing how to detect and correct errors that may occur in the data. 
Despite substantial efforts in industry and academia, no error-corrected 
qubits have been built so far. Quantum machine learning explores the 
application of machine-learning approaches to quantum problems, and 
vice versa—the application of quantum computing to machine-learning 
problems. The possibility of exponential speedups in optimization 
problems means that quantum machine learning has enormous poten- 
tial. In problems such as optimizing synthetic routes” or improving a 
given metric (for example, optical absorption for solar energy mate- 
rials) where multiple acceptable solutions exist, loss of qubit fidelity is 
less serious than when certainty is required. The physical sciences could 
prove a particularly rich field for quantum-learning applications”***. 


Establishing new principles 

Automatic discovery of scientific laws and principles®>* by inspec- 
tion of the weights of trained machine-learning systems is a potentially 
transformational development in science. Although models developed 
from machine learning are predictive, they are not necessarily (or even 
usually) interpretable; there are several reasons for this. First, the way 
in which a machine-learning model represents knowledge rarely maps 
directly to forms that scientists are familiar with. Given suitable data, an 
artificial neural network might discover the ideal gas law (pV=nRT), 
but the translation of connection weights into a formula, typically 
through statistical learning, is non-trivial, even for a simple law such 
as this. The second reason is more subtle: the laws that underlie the 
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behaviour of a compound might depend on knowledge that scientists 
do not yet possess, such as a many-body interaction giving rise to a new 
type of superconductivity. If an advanced machine-learning system was 
able to learn key aspects of quantum mechanics, it is hard to envisage 
how its connection weights could be turned into a comprehensible 
theory if the scientist lacked understanding of a fundamental compo- 
nent of it. Finally, there may be scientific laws that are so complex that, 
were they to be discovered by a machine-learning system, they would 
be too challenging for even a knowledgeable scientist to understand. A 
machine-learning system that could discern and use such laws would 
truly be a computational black box. 


As scientists embrace the inclusion of machine learning with 


statistically driven design in their research programmes, the number 
of reported applications is growing at an extraordinary rate. This 
new generation of computational science, supported by a platform of 
open-source tools and data sharing, has the potential to revolutionize 
molecular and materials discovery. 
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Niobium tungsten oxides for high-rate 
lithium-ion energy storage 
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The maximum power output and minimum charging time of a lithium-ion battery depend on both ionic and electronic 
transport. Ionic diffusion within the electrochemically active particles generally represents a fundamental limitation to 
the rate at which a battery can be charged and discharged. To compensate for the relatively slow solid-state ionic diffusion 
and to enable high power and rapid charging, the active particles are frequently reduced to nanometre dimensions, to 
the detriment of volumetric packing density, cost, stability and sustainability. As an alternative to nanoscaling, here we 
show that two complex niobium tungsten oxides—Nb;,W5O355 and NbjgW 6093, which adopt crystallographic shear and 
bronze-like structures, respectively—can intercalate large quantities of lithium at high rates, even when the sizes of the 
niobium tungsten oxide particles are of the order of micrometres. Measurements of lithium-ion diffusion coefficients in 
both structures reveal room-temperature values that are several orders of magnitude higher than those in typical electrode 
materials such as LigTis;0)) and LiMn 0,4. Multielectron redox, buffered volume expansion, topologically frustrated 
niobium/tungsten polyhedral arrangements and rapid solid-state lithium transport lead to extremely high volumetric 
capacities and rate performance. Unconventional materials and mechanisms that enable lithiation of micrometre-sized 
particles in minutes have implications for high-power applications, fast- charging devices, all-solid-state energy storage 


systems, electrode design and material discovery. 


New high-rate electrode materials that can store large quantities of 
charge in a few minutes, rather than hours, are required to increase 
power and decrease charging time in lithium-ion batteries. Such mate- 
rials can thus help to alleviate technological challenges associated with 
the adoption of electric vehicles and grid-scale batteries and enable 
the development of new power-intensive devices. The most intuitive 
and commonly used approach to increase rate performance is to create 
nanometre-sized or porous (and often hierarchical) structures, which 
minimize Li* solid-state diffusion distances, enable more rapid Lit 
transport through the composite electrode and increase the surface 
area of electrode materials in contact with electrolyte. Carbonaceous 
hierarchical structures and carbon-coating are also frequently used to 
improve electronic conductivity, which is another prerequisite for the 
application of high current densities. 

In practice, despite excellent lithium mobility, graphite cannot be 
used at high rates owing to particle fracture and the risk of Li dendrite 
formation, the latter leading to short circuits and potentially fires and 
explosions'~°. The dendrite issue inherently limits the use of low- 
voltage anodes in high-rate applications because electrode inhomogeneity, 
or any source of increased overpotential, can lead to Li plating poten- 
tials on the surface of the electrode®. LigTisOy2, with an average voltage 
of 1.55 V (Supplementary Fig. 1), enables high-rate Li (de)intercala- 
tion without the risk of Li dendrites or substantial solid—electrolyte 
interphase formation, albeit with an undesirable but necessary decrease 
in full-cell voltage and thus energy density. In this well-established 
‘high-voltage and high-rate anode, the capacity of 1-\1m particles 
from solid-state synthesis reaches only 60-65 mA h g™! at a rate* 
of 10C, where the C-rate is defined as the inverse of the number of 
hours required to reach a defined theoretical capacity; for example, 
10C corresponds to a 6-min discharge or charge time (see Methods). 
By contrast, through two decades of research, present carbon-coated 
nanoparticles of Li;TisO12 can reach at least 150 mA h g! at 10C>*, 


which corresponds to approximately 0.5 lithium ions per transition 
metal (Lit/TM). However, using nanometre-sized and porous materials 
for electrochemical energy storage applications inherently results in a 
severe penalty in terms of volumetric energy density. Furthermore, 
these carefully designed porous and nano-architectures are time con- 
suming and expensive to synthesize, characterize and manufacture. 
The relevant synthesis methods often result in relatively low yields or 
the generation of large quantities of chemical waste’, while also being 
more susceptible to degradation during electrochemical cycling (from 
processes such as catalytic decomposition of the electrolyte’, mor- 
phological changes that result in loss of nanostructuring? and higher 
first-cycle capacity loss’). 

In this work, we break from the conventional strategy of nanoscaling 
and nanostructuring of electrode materials to overcome poor ionic 
diffusion and electronic properties (found in TiO, and LiyTisO,2 for 
example). We demonstrate that with the appropriate host lattice, none 
of the usual size, structure or porosity criteria are required to achieve a 
practical high-rate battery electrode. Instead, we use insight obtained 
from previous investigations of complex binary niobium oxides, such as 
the low-temperature polymorph T-Nb,Os"', and of superionic conduc- 
tors, such as lithium lanthanum titanate perovskite (LLTO) 22 to identify 
structural motifs that should exhibit favourable Li diffusion properties, 
and thus superior performance, allowing micrometre-sized particles 
to be used at extremely high rates. We show that when multi-redox 
4d and 5d transition metals are used with the appropriate three- 
dimensional oxide structure, we can achieve extremely high volumetric 
energy densities and impressive rates. The bulk compounds studied 
are a series of complex ‘block or ‘bronze-like’ oxide structures (Fig. 1) 
largely comprising corner- and edge-sharing NbO¢ and WO¢ octahedra, 
and both oxides are prepared via gram-scale solid-state synthesis. 
Their unusual electrochemical performance is first illustrated by 
studying large (3-10 jum primary, 10-30 jtm agglomerate) dense 
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Fig. 1 | Crystal structure and particle morphology of NbisW5O55 and 
Nb igW16093. a-c, NbigW5Oss is built from blocks (red rectangles) of 4 x 5 
(Nb,W)Og octahedra, with adjoining blocks forming crystallographic 
shear planes. (Nb, W)O, tetrahedra connect the corners of the blocks. 

a, A view down the b direction of the structure. b, c Electron images of the 


particles of the block structure NbjgW5Oss (Fig. la-c). Even ona 
mass-normalized basis, the lithium storage performance of NbjgW5Os5 
exceeds that of nanostructured versions of the heavily studied 
LigTis012>!3-, TiO2(B)!©!8 and T-Nb2O5'* 7! under similar loading 
conditions. Given the high density of the crystal structure and the high 
tap density of bulk Nb}.W;5Os5 compared with nanomaterials, this leads 
to exceptionally high volumetric performance. We further demonstrate 
the generality of this bulk phenomenon by exploring another new elec- 
trode material, bronze-like Nb1gW160o3 (Fig. 1d-f). 


Complex oxides for energy storage 

Nbj6W5Oss is a metastable member of the system Nb2O;-WO3”, with 
a monoclinic structure composed of subunits of corner-shared octa- 
hedra arranged into ReO3-like blocks that are four octahedra wide by 
five octahedra long and infinite in the third dimension (Fig. 1a)*°. The 
block subunits are connected by crystallographic shear planes along the 
edges and by tetrahedra at each corner, and the structure is expressed 
as (4 x 5); in the notation (m x n),, m and n denote block length in 
units of octahedra and p relates to the connectivity of the blocks, which 
may also be joined in pairs (p =2) or infinite ribbons (p = 00). To the 
best of our knowledge, this is the first reported application of any 
kind for Nbj¢W5Oss since its discovery in 19657*4, NbjgW 1603 is 
orthorhombic, a 1 x 3 x 1 superstructure of the classic tetragonal 
tungsten bronze (Fig. 1d, Supplementary Fig. 2). The superstructure” 
results from partial filling of pentagonal tunnels by -M-O- chains to 
form pentagonal bipyramids, in addition to the distorted octahedra of 
the tetragonal tungsten bronzes. Nbi¢W5Os5 and NbjgW 460g; were pre- 
pared via co-thermal oxidation of pellets of NbO2 and WO). Details and 
alternative synthetic routes are described in Methods (Supplementary 
Figs. 3-7). 

Reaction of NbjgsW5Os5 with lithium (Fig. 2a) proceeds in 
three regions from 2.5 V to 1.0 V, with an average voltage of 1.57 V 
(Supplementary Fig. 1), comparable to the average voltage”® of 1.55 V 
of Li,Tis;012. The three regions, more easily observed in the derivative 
plot (Fig. 2b), are characterized by their slope and are reminiscent of 
the three regions observed in other crystallographic shear structures”, 
such as a second niobium oxide polymorph, H-Nb,0;”°, PNbyQ);””, 
TiNbO,*° and Nb,.WO33°!. When the kinetics were examined over a 
range of current densities from C/5 (34.3 mA g~!) to 60C (10.3 Ag”), 
NbisW5Oss showed unprecedented bulk rate performance (Fig. 2a, b, e) 
in standard electrode formulations (see Methods and Supplementary 
Fig. 8). At C/5, around 1.3 Lit/TM can be reversibly intercalated for a 
gravimetric capacity of about 225 mA hg™!. When the rate is increased 
by a factor of 25 to 5C, Nbis W5Oss maintains a capacity of 1.0 Lit/TM 
(171 mAh g/!). At 20C, which corresponds to a three-minute discharge, 


ARTICLE 


=— a 
micrometre-sized particles. d-f, NbjsW 606; is a superstructure of the 
tetragonal tungsten bronze (blue) with pentagonal tunnels (grey) partially 
filled by -W-O- chains that form pentagonal bipyramids. d, A view down 
the c direction, depicting the various tunnels. e, f, Electron images. In a 
and d, the black boxes indicate the unit cells. 


it is still possible to exchange 0.86 Li*/TM and access 148 mAhg |. 
Rate tests on Nbj.W5Oss were performed with a potentiostatic hold at 
the top of charge to ensure a reliable starting point for discharge. To test 
the performance under more demanding conditions, 1,000 cycles were 
measured with fixed galvanostatic discharge and charge conditions of 
10C for 250 cycles followed by 20C for 750 cycles with no voltage hold 
(Fig. 2f). Under these conditions, 0.90 Lit/TM (average 155 mA hg™!) 
were reversibly intercalated at 10C with 95% capacity retention after 
250 cycles on non-optimized or calendared electrodes. At 20C, the 
capacity was 0.75 Lit/TM (average 128 mA h g~') and the capacity 
retention was again 95% over 750 cycles at 20C. 

Excellent electrochemical energy storage was also discovered in another 
niobium tungsten oxide with distinct structural motifs: micrometre- 
scale particles of the bronze-like phase NbjgW 16093 (Fig. 1d-f) 
showed enhanced rate performance and could be cycled at extremely 
high rates (Fig. 2c-f). The average voltage of NbisW160o3 is 1.67 V 
(Supplementary Fig. 1). In terms of gravimetric capacity, NbigW 16093 
stores about 20 mA h g™! less than Nbig6W5Oss at C/5 and 1C owing 
to the higher molar mass of the tungsten-rich bronze phase. However, 
at 20C, Nb1gW 16053 is still able to accommodate a full unit of Lit/TM 
for a capacity of approximately 150 mA h g™!. At 60C and 100C 
(14.9 A g~4), the capacity is 105 and 70 mA h g"|, respectively. 

Other cycling conditions, such as long-term cycling at C/5, and the 
effect of current collectors, which cannot be ignored at high rates*”, 
were examined (Supplementary Fig. 9, 10). As a control, Li||Li symmet- 
ric cells were cycled at current densities corresponding to those applied 
for C/5-100C in Fig. 2 (Extended Data Fig. 1a). The overpotentials in 
the symmetric cell closely match those observed in the electrochemical 
cycling curves of Fig. 2a-d. This suggests that the extremely high rates 
in a bulk electrode approach the limits of Li metal plating/stripping or 
lithium-ion desolvation and transport in carbonate ester electrolytes 
at room temperature; that is, a considerable fraction of the ohmic loss 
during fast charging results from the Li metal and electrolyte rather 
than the complex oxide electrode materials. 


Measuring diffusion coefficients in mixed ionic- 
electronic conductors 

The extremely high mobility of Lit ions in these systems enabled the 
direct measurement of lithium diffusion with the pulsed-field-gradient 
nuclear magnetic resonance (PFG NMR) spectroscopy technique, 
which has previously only been applied to liquids or diamagnetic 
superionic solid electrolytes (Extended Data Table 1). To handle the 
short T> (spin-spin) relaxation times of Li nuclei, which generally 
prevent these measurements in electrode materials because they are 
mixed ionic-electronic conductors, experiments were performed in the 
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Fig. 2 | Electrochemistry of NbigW5Os5 and NbigW 16053. 

a-d, Galvanostatic discharge and charge curves and dQ/dV plots 

(Q, capacity; V, voltage) of bulk Nb},W5Os5 (a, b) and Nb} gW 60s; (¢, d) 
from C/5 up to 100C. e, Rate performance summary based on gravimetric 
capacity. f, High-rate cycling for 250 cycles at 10C, followed by 750 cycles 
at 20C. Dense electrodes of large particles with active mass loading of 

2-3 mg cm? were tested at current densities corresponding to discharge 


temperature range 333-453 K, at which the T> relaxation times were 
longer than at room temperature. Analysis of the data (Extended Data 
Fig. 2 Methods) for Li,Nb,}g6WsOs5 (x= 6.3, 8.4) and Li,Nb1gW 16093 
(x =3.4, 6.8, 10.2) showed lithium transport above 107 m*s~t at 
333 K and 10~* m? s~! at 373 K. Assuming Arrhenius behaviour and 
the extremely low measured activation energies of 0.10-0.30 eV, the 
room-temperature lithium diffusion coefficients are estimated to be 
greater than 1 x 107!? m’s~! for all the materials (Table 1). The dif- 
ferent samples exhibited similar diffusion coefficients and activation 
energies, consistent with the transport measurements made with the 
galvanostatic intermittent titration technique (GITT) (see Methods and 
Extended Data Fig. 1b, c) at low-to-moderate lithium contents. Lithium 
diffusion in both niobium tungsten oxide structures is markedly faster 
than that of Liq,,TisO12 or LiyTiO2 (about 107!°-10~- m? s~!) and is 


Table 1 | Lithium diffusion coefficients, obtained from PFG NMR 


Dy (m? s~4) at Di (m2 s~!) at 


Component 298 K 413K E, (eV) 

Lig. 3NbigW50s55-a (15%) 2.1x 10-14 2.7x 1071! 0.23 + 0.04 

Lig. 3Nbi¢Ws50s5-b (85%) 1.7x 10-18 5.2x10-1!8 0.10 + 0.04 

LigaNby6Ws50s55 1.6x 10-38 6.5x10-8 0.13 + 0.0 

(403 kK) 

Li3.4aNbigW16093 1.x 10-38 2.0x 10-12 0.27 + 0.03 

Lig.gNbigW16003 1.110718 1.8 x 10-14 0.30 + 0.0 

Lij0.2NbigW16093 1.1 x 10718 2.7 x 10-12 0.29 + 0.0 
The lithium diffusion coefficients, D,;, were measured directly at 333-453 K and extrapolate 
from the activation energy, E,, to room temperature, where the relaxation time T2 was too short to 
allow direct measurement. Error estimates for activation energies are derived from the standard 


error of the linear fit. Error bars for the diffusion coefficients are given in Extended Data Fig. 2. The 
two diffusion components observed in Lig 3Nbj¢WsOs5 are denoted as ‘-a’ and ‘-b’ with 15% and 
85% signal contribution, respectively. 
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Cycle index 


times of several hours to tens of seconds. Nbjs5W5Os5 was charged with 

a 1-h constant-voltage step at the top of charge to ensure a comparable 
starting point on discharge; Nb}gW 6093 was cycled without this hold 
step for all measurements and stored over 100 mA h gl at 60C (that is, in 
less than 60 s). High-rate cycling for 1,000 cycles was performed on both 
oxides at 10C and 20C and at a constant current, without a potentiostatic 
step. 


close to that of the best known lithium solid electrolytes (Extended 
Data Table 1). GITT and 7Li PFG NMR spectroscopy results (Extended 
Data Figs. 1b and 2, respectively) indicate that this rapid motion is 
maintained to high lithium contents (>1.5 Lit/TM), and then the dif- 
fusion drops by about two orders of magnitude towards 2.0 Lit/TM. 
These complementary techniques suggest that the inherent range of 
the niobium tungsten oxide electrode materials for high-rate multi- 
redox extends to approximately 1.5 Lit/TM. The diffusion coefficients, 
of the order of 10~!*-10~'3 m? s~!, measured for these materials are 
consistent with the values required to achieve full lithiation of 10-j»m 
particles on a 60C timescale (Supplementary Table 1). 


Redox activity and local atomic structure 

To understand (i) the nature of the charge-transfer sequence 
as a function of lithiation and (ii) the origin of multielectron redox 
behaviour in Nbj.Ws5Os5 and NbjgW160o3, the X-ray absorption 
near-edge structure (XANES) of the Nb K edge and W L edges was 
analysed (Supplementary Fig. 11, Methods). For NbjsW5Oss, oper- 
ando and ex situ Nb K edge XANES spectra show a nearly linear trend 
between the number of electrons (and Li*) transferred and the oxida- 
tion state of niobium, extracted from the shift of the absorption edge 
(Fig. 3a, Extended Data Fig. 3, Supplementary Fig. 12). Similarly, ex situ 
samples measured at the W L,,, edges show a steadily negative correla- 
tion between capacity and edge position; however, there appears to be 
a larger shift in the tungsten absorption edge for the first 0.5 Lit/TM 
inserted (Fig. 3b, Extended Data Fig. 3, Supplementary Fig. 12), indi- 
cating a slight preference for tungsten reduction initially. The situation 
for NbigW16Oo3 (Fig. 3, Extended Data Fig. 4, Supplementary Fig. 12) is 
analogous in that the niobium oxidation state decreases nearly linearly 
and tungsten reduction is slightly favoured initially. However, there is a 
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Fig. 3 | X-ray absorption spectroscopy of NbjgW5Os5 and NbjgW 16093. 
a, b, Oxidation states of niobium (a) and tungsten (b) as a function of 
lithiation, extracted from shifts in the absorption energy of the Nb K 
edge and the W L,, edge. The black dashed line demarcates a slope of —1, 
for reference. Oxidation-state error bars are estimated to be £0.2 on the 
basis of energy data resolution. The operando electrochemical discharge 
profile (‘echem) is included in a. c, Pre-edge integrated peak intensity 


clear preference for multielectron reduction on the tungsten through- 
out the lithiation, with Nb** appearing only beyond 1.5 Lit/TM, 
which may be related to cation occupancy variation on octahedral 
versus pentagonal bipyramidal sites. 

The redox centres Nb®** and W°* in Nb;},W;0s5 are both @° and both 
of these cations located in six-coordinate environments experience 
second-order Jahn-Teller (SOJT) distortions, which give rise to a pre- 
edge feature before the main absorption edge in the Nb K and W L, edge 
X-ray absorption spectra (Extended Data Figs. 3, 4, Supplementary 
Figs. 13, 14, Methods); this serves as a direct probe of local symmetry 
and an additional measure of the oxidation state. As the d° cations are 
reduced, the energy of the d states moves up and the SOJT distortion 
is reduced (Methods), which increases the local octahedral symmetry 
and decreases the pre-edge states and intensity monotonically (Fig. 3c, 
Extended Data Figs. 3, 4, Supplementary Fig. 13)”, again with a slightly 
larger effect on tungsten at low lithium concentrations and a bigger 
difference between tungsten and niobium in Nb}gW 6093. By about 
0.8-1.0 Li*/TM, both the Nb K and W L, distinct pre-edges of both 
the block and bronze phase have decreased and no substantial further 
changes are observed in compositions corresponding to multi-redox; 
thus, lithiation is associated with an increase in local symmetry for the 
d° oxide intercalation hosts. The XANES edge shifts and pre-edge evo- 
lution for Nb}g.WsOs5 and NbjgW160o93 observed in this work indicate 
a parallel reduction pathway and multielectron charge storage of both 
transition metals. 


Anisotropic host-lattice response to Li intercalation 

To probe the lattice evolution of these complex oxides upon lithiation, 
operando synchrotron X-ray diffraction was performed at different 
cycling rates. At C/2, Nbig6Ws5Os5 evolves through a complex, three- 
stage solid-solution mechanism (Fig. 4a, c, Extended Data Figs. 5, 6) that 
correlates with the following observed electrochemical regions: (i) high 
voltage (until about 65 mA h g™! or 0.4 Lit/TM): a-c-plane expansion 
of the blocks, along with a slight expansion perpendicular to the block 
plane; (ii) about 65-170 mA h g! (0.4-1.0 Lit/TM): anisotropic behav- 
iour involving a contraction of the blocks and a substantial expansion in 
the (perpendicular) b direction; (iii) multi-redox (beyond 1.0 Lit/TM): 
linear expansion in all dimensions. The volume expansion is consid- 
erably buffered by the block contraction in the second stage, and the 
lattice undergoes only 5.5% expansion at lithiation to 1.0 Lit/TM. 
The diffraction results show that although the second electrochemical 
process has a relatively small gradient it is not two-phase, in agreement 
with the GITT measurements (Extended Data Fig. 1b, c). Upon charging, 


Li per transition metal 


Li per transition metal 


from the Nb K edge and W L, edge serves as a measure of local distortion 
from SOJT effects on d° octahedral sites and, as the SOJT effect is relaxed 
upon reduction to d! (see Methods for further discussion), a further 
indication of the oxidation state. Pre-edge integrated peak intensity error 
bars represent 20 from Levenberg-Marquardt nonlinear least-squares 
minimization fits (see Methods and Supplementary Fig. 13). 


the stages are reversed (Extended Data Fig. 6k, 1), although there is 
some first-cycle capacity loss. This capacity loss is ascribed, at least in 
part, to residual Li in the structure (Extended Data Fig. 7), rather than 
the usual solid—electrolyte interphase formation, because the final lith- 
ium ions are considerably harder to extract as their removal would lead 
to the formation of insulating domains. At a ten-times-higher rate (5C), 
only the first two stages of lattice evolution are observed (Extended 
Data Figs. 5, 6), which is consistent with the degree of loss of capacity 
at higher rates. In addition, there is more strain and reaction inhomoge- 
neity at 5C, consistent with previous in situ studies that clearly showed 
that at these rates, lithium transport within the electrolyte results in 
internal (electrolyte) inhomogeneities and concentration gradients***4. 
Nevertheless, the mechanism remains solid-solution in nature, which 
is evident from the high inter-peak intensity (Extended Data Fig. 6j). 

Operando diffraction measurements of NbjgW 16Oo3 were performed 
at rates of up to 10C (Fig. 4b, d; Extended Data Figs. 5, 8, Supplementary 
Fig. 15). Like the block phase, the bronze phase shows a complicated 
but reversible nonlinear and strongly anisotropic structural evolu- 
tion upon lithiation (Fig. 4b, d; see Methods for further discussion). 
Interestingly, the b = 3a pseudo-superstructure relationship persists 
until the fourth intercalation region, at >0.5 Lit/TM. From this point, 
the unit-cell volume actually decreases with increasing lithium interca- 
lation and is the same at 1.0 Lit/TM as at 0.5 Li*/TM—a total increase 
of only 2.8% from the unlithiated host. This volume contraction upon 
lithium insertion is phenomenologically related to negative thermal 
expansion*>** or negative linear compressibility*” and may have shared 
origins related to tilting of the polyhedral units (see “Host features and 
design considerations’). The small volume change has implications for 
the suppression of intergranular cracking and long-term cycle perfor- 
mance*®, Unlike in the shear structure, the bronze phase exhibits a 
compositionally narrow two-phase reaction between approximately 
Lig 6NbigW 16093 and Liyo2Nb1sW 16093 (0.2-0.3 Lit/TM); the phase 
transition preserves the Pham space-group symmetry and is charac- 
terized by a-b-plane expansion and contraction of the bronze layers. 
The structural stability of both the block and bronze phases is also 
reflected in the stability of the energy storage capacity over 1,000 
discharge-charge cycles (Fig. 2f). 


Host features and design considerations 

Relative to the parent ReO; structure, the niobium tungsten oxides 
accommodate anion-deficient nonstoichiometry by forming topolog- 
ically distinct condensed phases. This has important consequences 
for Li motion because the intersecting crystallographic shear planes 
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Fig. 4 | Nonlinear structural evolution of NbjsW5O55 and NbjgW 16093 
from operando synchrotron diffraction. a, b, Operando diffraction 
pattern sections of Nbjs5W5Os5 (a) and Nb;sW 60o;3 (b), with some of 

the strongest reflections labelled and the electrochemistry overlaid. 

0, diffraction angle; I, intensity. c, Upon lithiation, block-phase Nb;,.WsOss5 
evolves in three stages in the range 3.0-1.0 V at C/2; the expansion is 


(block phase) or twisted octahedra locked to pentagonal columns 
(bronze-like) (Fig. 1a, d) decrease the structural degrees of freedom. 
Both the block (NbigsW50s5) and bronze (NbjgWj60o3) structural 
motifs are derived from the parent ReO3 structure type, of which 
WO; is a locally distorted analogue and LLTO is a version with 
Lit and La?" cations on the A site (Supplementary Fig. 16). In 
ReO3, WO3 and LLTO, Li ions occupy 4- and 5-coordinate sites and 
diffuse through square planar window transition states. It is 
well established that this process is fast; an activation energy of 
120-220 meV is found in LLTO, whose framework is stabilized 
by the large lanthanum ions*’~”’. By contrast, although ReO; and 
WO; should also be suitable for rapid three-dimensional lithium 
motion, their open and flexible framework can undergo a structural 
phase transition that locks the Li* in place, reducing Li mobility 
considerably”. 

The shear and bronze structure formation mechanisms result in 
frustrated polyhedral networks that can no longer undergo tilts upon 
lithiation that would clamp the lithium ions, preventing their motion. 
These concepts are well established in perovskites, where the so-called 
rigid-unit modes drive long-range cooperative distortions despite 
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buffered by block contraction in the second stage. d, Bronze-like-phase 
NbigW60o3 evolves through six stages upon lithiation in the range 
3.0-1.0 V at 1C. In this complex oxide, volume expansion is buffered by 
layer contraction in the second stage (two-phase) and b-axis contraction in 
the fourth and fifth stages. Alternating white and purple shading serves as 
a guide to the eye to demarcate reaction stages. 


the framework structure**”. By contrast, crystallographic shear and 
twisted, locked octahedra (Supplementary Fig. 16) lead to two funda- 
mentally different—but effectively three-dimensional—conduction 
networks for lithium, while stabilizing the framework and precluding 
substantial Li-induced rigid-unit mode displacements that lead to local 
and long-range structural rearrangements. 

Structural analyses and bond-valence energy landscape calcu- 
lations (Extended Data Fig. 9a, b) indicate that infinite lithium 
diffusion in the NbjgW5Oss block phase is one-dimensional along the 
b axis, but the twelve parallel tunnels act as a metaphorical multi-lane 
highway, enabling lithium to change ‘lanes’ via local hops in the a-c 
plane. Lithium sites and diffusion in this twelve-channel path may be 
analogous to those of the ReO3 and LLTO structure types (Extended 
Data Fig. 9c), but without the phase transition of ReO; or the blocking 
La** ions of LLTO. Furthermore, given this mechanism, NbisWs5Os5 
should not be susceptible to the tunnel-blocking defects that hinder 
micrometre-sized one-dimensional conductors, such as LiFePO,***°. 
The NbjgW460o3 bronze-like phase has an infinite two-dimensional 
lithium pathway similarly to high-rate T-Nb2Os in the a—b crystallo- 
graphic plane, with the additional benefit of four- and five-membered 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


a b 
600 5 TTT T TTTTT T T T prriy TT TTT TT TITTY TT TTT TT TTT 
L HB 1C rate (60-min discharge) sg8 Fa e 
500 1S 20C rate (3-min discharge) E * e q 
A ' Kk AA 1 
é | [ > x Aa, wi} 
£ [ a a 
<x 400 qoan 
> i ig td * > R | 
8 7s F Vvy | 
2 300 4 = a YY 
° = aoeh 4 
2 J eons vV a 
= 200 | 2 [fr Nb gW,Oeg (3-10 um) @ : 
3 4 2 + @ Nb,gW,gOog (3-10 um) | 
a 106 || [ @ Li,Ti,0,5 (100 nm/1 um) 1 
b> Li,Ti,O,./C (100 nm/1 um) | 
| A T-Nb,O,/holey-graphene (15 nm, pressed) 
aL : 4 
VLDGEVDDDOGVOVIOGDHEDRDELFEEDDOUODDELCEEERDSED DO 10°E W Ti0,(8) (6 nm NTs/rGO, graded) ; 
S522 ESSSSSSese ess ZSSS9SSSQ ECS G aS52 L 4 Graphite (6-44 um, magnetically aligned) a 
= Py errr: _ r 7 
aoEESEEESTESSA=ESESESESELE SE SECS ETPETECE Cool pe et el al 
SES SSE ER PG z= EES TEE EES FEES OO Ge SOESES 10! 10? 108 104 108 
SSE BOCrBSnSEGEBESEMSSHUBHAGZEDREEEGLOS S F “4 
FERRE ORES BES EPS HEsLes Ve eSVelowrasSeeega Power density (W |") 
2S Se OSS GE 5 2A ZB WAT EE OO E OD PSGE EER BG OSS 
SOF ES EER ES GE RE ud 220 So OAZoO BG GHOLSVIRzS 
TT eos soo SH ZGe GE SSI nesses oss PPSZED 
ene segs sorAzAGEeEemEOP er Mert e222 Woz @eOSES 
Se BS SBSSEPEES SFOS MREEEQFFFEOSEFO Ectgor 
88er 20s Ef Por PRAIA H- os -SP 525 
oKoy fEE oeZ= 95 am Ze Z FES 2 of 
=o 7: pee a rs FOF +8 BSS 
oF, Q556o = > Yo §05 
ES ago = O xo) | o 
Sa oOYF * ° 2s any 
4 Fre S 22 5 
oO a fo) $s O 
F = st 5% 
Zz 6 in 
E 3 


Fig. 5 | Charge, energy, and power densities of micrometre-sized 
Nbi¢6W50s5 and NbjgsW 16093 compared to state-of-the-art high-rate 
nanoscale materials and formulations. a, Nb}gW;5Os55 and NbigW 16093 
charge-storage results from this work are compared to high-rate electrode 
formulations from the literature (see Supplementary Table 2). Volumetric 
capacities at 1C and 20C are determined from reported capacities and 
from reported, measured or estimated tap densities (Supplementary 
Table 2). The sizes of the primary and, where appropriate, secondary 
particles or pores are indicated. NP, nanoparticle; NW, nanowire; rGO, 


windows and channels along the c axis to increase the dimensionality 
of long-range diffusion to three dimensions. As a result, the structure 
appears to exhibit better performance, certainly under comparable 
morphologies and conditions, than the recently intensely studied 
T-Nb2O5!!"!9?85152. The bronze comparisons stress the complemen- 
tary benefits of the open room-and-pillar host structure found in 
T-Nb2O5!! and Nb ;gWj60o3 and the additional perpendicular lithium 
diffusion channels in NbjgW 60o3. In addition, both Nb},.W;Os5 and 
NbigW160o3 may benefit from the randomly distributed Nb and W 
occupancy throughout the metal sites, which prevents rate-inhibiting 
long-range lithium ordering(s). NbjgW5Os55 and NbigW460o3 are 
compared to their binary counterparts H- and T-Nb2Os, respectively, 
as a function of capacity and Lit/TM (Extended Data Fig. 9d-g). 
Although the niobium tungsten oxide hosts are d° insulators, their 
three-dimensional -M-O- connectivity enables effective electronic 
pathways because the structure is n-doped upon lithiation. In addition, 
variations of the bronze and crystallographic shear structure types are 
abundant and tunable for specific applications and beyond Li electro- 
chemical energy storage (see Methods for further discussion regarding 
structure family extensions). 


Volumetric energy density and future applications 

When compared strictly on the basis of the theoretical 1.0 Lit/TM 
reaction and crystallographic density of the active material, 
titania, niobia and graphite could all display charge densities of 
greater than 800 A h1~' (Supplementary Fig. 17). Once experimental 
capacities, multielectron redox and tap density are considered (Fig. 5a, 
Supplementary Table 2), the bulk, unoptimized niobium tungsten 
oxides presented here maintain volumetric charge storage densi- 
ties greater than 500 Ah 17’ at 1C and up to 400 A h1! at 20C— 
performance that even the most optimized versions of TiO2, Nb2O5 


reduced graphene oxide; NS, nanosheet; NT, nanotube; CDC, carbide- 
derived carbon; CNT, carbon nanotube. b, Ragone (double logarithmic) 
plot of energy density and power density for anode active materials 
versus a 4.0-V cathode (Methods and Supplementary Table 2, including 
relevant literature references). The mass loading of the niobium tungsten 
oxides was 2.6 mg cm~” and that of the other titanium- and niobium- 
based materials was about 1 mg cm~?. Graphite is included as a reference, 
although it suffers from Li plating risks and particle fracture when used at 
high rates. 


and Li,Tis0,2 cannot achieve. This does not mean that the compounds 
presented here cannot be improved by methods such as nanostruc- 
turing, calendaring or carbon coating (see, for example, ref. °”), but 
proves that large, micrometre-sized particles can be used for high-rate 
electrodes and illustrates that nanoscaling is not always the most appro- 
priate strategy to improve performance. This is evident in a Ragone 
plot (Fig. 5b), which shows the higher energy and power densities of 
the new bulk niobium tungsten oxides compared with literature values 
for state-of-the-art high-rate anode materials and with graphite (see 
expanded dataset including other electrode materials, details and 
references in Supplementary Table 2). 

The ability to intercalate lithium into microscopic particles in min- 
utes, with transport numbers approaching the best liquid and solid 
electrolytes, calls for a different paradigm for electrode structuring. 
Rather than focusing on particle dimensions, other aspects—such as 
electrolyte transport or counter electrode diffusivity and overpotential— 
may become more critical to enhancing performance. The strict 
requirements for carbon coating and intricately wiring nanoparticles 
are relaxed and issues stemming from surface reactivity and stability are 
also diminished. Other applications of these materials can be envisaged. 
For example, in the field of all-solid-state batteries, there has been a 
mismatch between lithium transport in superionic solid electrolytes 
compared with electrode materials. If both components have similar 
diffusivities, it may be possible to design and implement new, simplified 
composite structures to speed up the realization of safe all-solid-state 
energy storage. 

In conclusion, extremely high-rate performance has been achieved 
without nanoscaling by identifying appropriate three-dimensional 
oxide crystal structures. The two new electrode materials, NbjgW5O0s55 
and NbjgW16Oo;, effectively use superstructure motifs to provide stable 
host structures for lithium intercalation with facile and defect-tolerant 
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lithium diffusion and multielectron redox. Volume expansion is miti- 
gated by structural contraction along specific crystallographic axes in 
response to increased lithium content, which may enable the extended 
cycling of these large particles*’. The materials investigated here operate 
in a similar voltage region to the well-studied, and generally consid- 
ered to be ‘safe; anode materials LigTis0j2 and TiO.(B). In contrast to 
approaches that try to overcome physical properties (such as ionic and 
electronic conductivity) extrinsically (for example, by nanoscaling and 
carbon coating), the high-rate, high-capacity properties of the niobium 
tungsten oxide block and bronze phases presented here are intrinsic 
to the complex atomic and electronic networks. The path forward for 
new fast ionic conductors should consider host structures with open 
yet frustrated topologies (that prevent structural rearrangements that 
reduce Li transport) and that also contain ‘disorder’ in the sense of a 
multitude of guest sites for Li* and limited interaction between the host 
and guest (that is, no strong coupling between the diffusing Li* and 
associated electron, as found in LiFePO,°4, or between the Li* and the 
host structure itself). This leads to a relatively flat potential energy sur- 
face with small kinetic diffusion barriers for Li transport. These criteria, 
which lead to extremely high power and energy densities (Fig. 5b), 
are satisfied in NbjgW5Os5 and NbjgW60o3 via crystallographic shear 
and pentagonal columns that prop open the framework structures and 
through transition-metal disorder that prevents Li ordering. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0347-0. 
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METHODS 

Synthesis and structure characterization. Nbj.WsOss5 and NbigW 16093 were 
synthesized by co-thermal oxidation of dark-blue NbO (Alfa Aesar, 99+%) and 
brown WO) (Alfa Aesar, 99.9%) in batches of approximately 1-5 g. The partially 
reduced oxides were massed to within 0.001 g of the 16:5 or 18:16 stoichiometric 
ratios, ground together by hand with an agate mortar and pestle, pressed into a 
pellet at 10 MPa, heated in a platinum crucible at a rate of 10 K min“ to 1,473 K, 
and naturally cooled in the furnace over approximately 2 h. Both phases formed 
subhedral particles with approximately 3-10-|1m primary particles that appeared 
in the electron microscope to be intergrown/‘cemented’ into larger secondary 
particles of around 10-30 jum (Fig. 1, Supplementary Fig. 3a, b). As synthesized, 
Nb15W50s5 is pale yellow-green and NbjgW 6Ob3 is off-white (Supplementary 
Fig. 3c) (we note that the colours of both mixed metal oxides are between those of 
light-green WO; and white Nb2Os). Energy-dispersive X-ray spectroscopy (EDX) 
revealed a uniform distribution of niobium, tungsten and oxygen, with no apparent 
phase-segregated particles and no presence of any other elements (besides those 
on the background adhesive carbon) (Supplementary Figs. 4, 5). We note that 
although phases such as Nb2O; and WO; would be distinct via EDX, particles of 
niobium tungsten oxide phases with similar composition to that of NbjgW5Oss or 
Nb gW 16093 would not be distinguishable. Although the phase diagram>? suggests 
that NbjsW5Os; is only formed at about 1,363-1,658 K and is a metastable phase 
that should disproportionate to Nb}4W3044 and Nb2WOs on cooling, this was 
not observed. Nbi.W5Os5 retains its high-temperature structure on quenching” 
and even with more modest cooling. In the original synthetic report”, Roth and 
Wadsley also noted that phase-pure Nb}sW5Os5 was obtained even after annealing 
at 1,273 K for one to three days after an initial heat treatment at 1,623 K for 24h. 
To investigate other synthetic methods, samples were also prepared by quenching 
the samples from 1,473 K onto a steel plate, as well as from analogous pellets or 
loose mixed powders of Nb2Os (Alfa Aesar, 99.9985%, H-phase) and WO; (Sigma- 
Aldrich, >99%, \-phase) precursors. Large (20 g) single batches of Nb}sW5Os5 and 
Nb gW 16093 were produced by hand grinding (agate, 5 min) or ball milling (SPEX 
8000M high-energy mixer/mill; zirconia jar with two 5-g zirconia balls, 90 min) 
Nb2O; and WO; powder, followed by heating in a platinum crucible covered with 
a platinum lid in air at 973 K for 12 h and then at 1,473 K for 12 h. The large 
batches were heated at 10 K min™! and the samples were allowed to cool naturally 
in the furnace over about 2 h. Thermogravimetric analysis (TGA, Supplementary 
Fig. 6) revealed that WO; oxidizes to WO; starting at around 700 K and NbO, 
oxidizes to Nb2Os starting at around 550 K. At 1,273 K, the WO, sample had gained 
7.8% mass versus the expected 7.4%, and NbO; had gained 5.8% compared to the 
expected 6.4%. These small differences may arise from slight off-stoichiometry in 
the starting material. WO3 undergoes sublimation at temperatures near or above 
1,273 K; however, the reaction with niobium oxide apparently stabilizes the vola- 
tile tungsten to considerably higher temperatures. After 24 h at 1,473 K, the mass 
change was -0.4% for NbjgW5Os5 and —0.3% for NbigW 16093 compared with the 
expected values, which is within the error of the expected mass changes for NbO2 
and WO); oxidation. 

Laboratory powder X-ray diffraction patterns were measured at ambient 
temperature with a Cu Ka source in Bragg-Brentano geometry at a scan rate of 
0.83° min”!. The sample stage rotated continuously to improve powder averaging. 
Sample phase and purity were determined by Rietveld refinement in GSAS-II 
(Supplementary Fig. 7)°”. Given the synthesis time and temperature (24 h and 
1,473 K), the block-phase NbjsWsOss samples are expected to contain several 
per cent of Wadsley defect fringes, as suggested by the laborious study of Allpress 
and Roth®. 

TGA measurements were performed on a Mettler Toledo TGA/SDTA 851 thermo- 
balance. Samples were placed in a tared 100-11 alumina crucible and the mass 
was recorded from 323 K to 1,273 K in steps of 1 K min! under a constant air flow 
(50 ml min“). A blank with an empty crucible was recorded under the same con- 
ditions and subtracted from the NbO,/WO, data. The raw data were numerically 
differentiated to obtain differential thermogravimetry curves. 

Scanning electron microscopy images were taken with a Sigma VP instrument 
(Zeiss) at 3.0 kV and a MIRA3 instrument (TESCAN) at 5.0 kV with secondary- 
electron detection. 

Tap density was recorded on an AutoTap (Quantachrome Instruments) instru- 
ment operating at 257 taps per minute. Tap densities were measured according to 
the ASTM international standard B527-15, modified to accommodate a 5-10 cm? 
graduated cylinder. 

Electrochemical characterization. All electrochemical measurements were 
conducted in stainless-steel 2032 coin cells (Cambridge Energy Solutions) with 
a stainless-steel conical spring, two 0.5-mm-thick stainless-steel spacer disks, a 
plastic gasket and a glass microfibre separator (Whatman). The electrode had 
an active material:carbon:binder mass ratio of 8:1:1, active material loading of 
2-3 mg cm * and an area of 1.27 cm’. The metal oxide and conductive carbon 
(Super P, TIMCAL) were ground by hand in an agate mortar and pestle in an 8:1 


mass ratio. This powder was ground in a 9:1 mass ratio with poly(vinylidene diflu- 
oride) (PVDE, Kynar) dispersed in N-methyl pyrrolidone (NMP; Sigma-Aldrich, 
anhydrous, 99.5%). Although standard, SuperP carbon is a nanoparticulate powder 
and NMP is a hazardous organic solvent, so appropriate nanoparticle cabinets and 
fume hoods should be used. This metal oxide/carbon/polymer electrode served 
as the cathode against a Li metal-disk (LTS Research, 99.95%) anode in half-cell 
geometry. The electrolyte for all experiments was 1.0 M LiPFs dissolved in 1:1 v/v 
ethylene carbonate/dimethyl carbonate (EC/DMC; Sigma-Aldrich, battery grade). 
No additives were used. Electrochemistry measurements were performed in a 
temperature-controlled room at 293 + 2 K. A galvanostat/potentiostat (BioLogic) 
with EC-Laboratory software was used to perform the electrochemical measure- 
ments. All potentials are reported with respect to Lit /Li?. 

In this work, the C-rate is always defined relative to one-electron trans- 
fer per transition metal; for example, for Nbjs W5Oss5, 1C = 171.3 mA g™! and 
20C = 3,426 mA g”. Theoretical capacity is calculated by: 


nF 21 x 96,485.3 C mol 
3.6m 3.6C mAh! x 3,285.65 g mol ! 


-1 
Qtheoretical = =171.3mAh g 


where n is the number of electrons transferred per formula unit, F is Faraday’s 
constant, 3.6 is a conversion factor between coulombs and the conventional 
milliampere-hour and m is the mass per formula unit. 

Ina Ragone plot”, the energy density of a battery cathode material or full cell 

is the product of capacity (Q) and voltage (V); however, this quantity is not appro- 
priate when comparing anode materials, where energy and voltage have an inverse 
relationship. In the calculation of the anode-material Ragone plot in Fig. 5b, the 
energy (E) is computed on the basis of the voltage difference versus a 4.0-V cathode. 
Thus, normalized to the anode, Eanode = Qanode( Veathode — Vanode). Literature values 
for the Ragone plot were extracted with WebPlotDigitizer™. 
Operando synchrotron X-ray diffraction. Operando powder diffraction meas- 
urements were carried out at beamline 17BM (bending magnet) at the Advanced 
Photon Source at Argonne National Laboratory. The samples were measured at 
ambient temperature in transmission geometry at 17.0 keV (0.7295 A or 0.72768 A) 
with an area detector. All operando measurements were performed in the AMPIX 
cell, which has been described elsewhere’. In brief, it contains a hard, conduc- 
tive glassy carbon window to prevent inhomogeneous electrochemical reactions, 
which are a concern with flexible or non-conductive X-ray windows®!. Owing 
to the absorbing nature of niobium and tungsten and the absence of a current 
collector, self-standing electrodes were made with a 3:6:1 (NbjsW5Oss at C/2) or 
5:4:1 (Nbj6W5Oss at 5C and NjgW460¢3 at 1C, 5C and 10C) mass ratio of metal 
oxide:SuperP carbon:poly(tetrafluoroethylene). The resulting electrodes were 
approximately 1.0 cm in diameter and contained 5.3 mg, 8.7 mg and 8.0 mg of 
active material for Nby6WsOss (C/2), Nbye6WsOss (5C) and NigW 16093 (1C, 5C 
and 10C), respectively. These electrodes correspond to areal metal-oxide mass 
loadings of 6.8-11.1 mg cm~*. AMPIX cells were constructed in an argon glove- 
box with lithium-metal counter-electrodes, glass-fibre separators (Whatman) and 
1.0 M LiPF, dissolved in 1:1 v/v ethylene carbonate/diethyl carbonate (EC/DEC). 
Two-dimensional image data were converted to conventional one-dimensional 
diffraction patterns through integration in GSAS-II after calibration with LaBg’”. 
Background subtraction, primarily from the glassy carbon window, and normaliza- 
tion to incident X-ray intensity were performed on the one-dimensional integrated 
diffraction data. 

Sequential Rietveld refinement of the unit-cell parameters of NbjgW5Oss5 and 
Nb)gW 16093 during operando galvanostatic electrochemical lithiation was per- 
formed in GSAS-II. The initial structure model was based on Rietveld refinement 
of the atomic coordinates, unit-cell parameters, background, zero offset and peak- 
shape parameters (U, Vand W; peak width squared (H”) = Utan’0 + Vtand + W)* 
of the first in situ diffraction pattern before the application of a current. For the 
sequential refinement, only the unit-cell parameters and Chebyshev background 
coefficients were allowed to vary. Refinement of the atomic coordinates was not 
attempted owing to the large number of variables this would introduce, the weak 
relative scattering of lithium and oxygen versus niobium and tungsten, and the 
optimization of experiments towards rapid data collection. NbjgsWsOss contains 11 
metal sites and 28 oxygen sites and Nb;gW 160s; contains 10 metal sites and 25 oxy- 
gen sites in the asymmetric unit. Two-dimensional diffraction images (integrated 
to one-dimensional patterns) were collected in 7-15 s (Supplementary Table 3). 

Nb gW 16093, which exhibits a region of two-phase behaviour, was fitted using a 
double-ended sequential Rietveld refinement, in which the process described above 
was also initiated from the structure at the end of the discharge and a ‘backward’ 
refinement was performed. Error bars (Extended Data Fig. 5) are derived from the 
refinements. The unit-cell parameters in the two-phase region of NbjgW16Oo3 and 
the inhomogeneous solid-solution region of NbjsW5Os5 were estimated by fitting 
individual reflections to two sets of lattice parameters (NbigW 16093) and meas- 
uring the range of individual reflections (NbjsWsOss); thus, error bars in these 


© 2018 Springer Nature Limited. All rights reserved. 


regions cannot be calculated using the whole pattern-refinement method. Owing to 
the unit-cell dimensions and pseudosymmetry, there is severe overlap of reflections 
in the powder diffraction patterns of both NbjgW 16093 and NbygWsOss, even at 
the high resolution available at a synchrotron X-ray source. Furthermore, aniso- 
tropic lattice expansion results in reflections crossing during discharge (Fig. 4), 
which introduces challenges for sequential refinement. At 145-155 mAh g™ in 
Nbj«W50s5, this crossover (Fig. 4a, Extended Data Fig. 6k) resulted in strongly 
correlated and inconsistent fits for four consecutive diffraction patterns. These 
patterns were fitted according to the individual reflection model described above 
assuming linear changes—an approximation over the small change in lithium con- 
tent and unit-cell dimensions. 

For the crystallographic shear structure with monoclinic symmetry 

(Nbi5W5;Os5; Fig. la, Fig. 4c), the b axis represents the ‘layered’ direction and the 
‘block area’ is parameterized by acsin3, where the a and c axes define the plane of 
the block and (3 is the angle between a and c. The unit-cell volume of this phase is 
the product of the block area and the layer spacing, (acsin/3)b. For the bronze-like 
phase with orthorhombic symmetry (Nb}gW16Oo3, Fig. 1d, Fig. 4d), the unit-cell 
volume is simply the product of the crystallographic axes, abc. 
Operando and ex situ synchrotron X-ray absorption spectroscopy. Operando 
X-ray absorption spectroscopy (XAS) was performed at beamline 9BM (bending 
magnet) at the Advanced Photon Source at Argonne National Laboratory. Nb K 
edge spectra were recorded at ambient temperature in transmission mode above 
and below the absorption edge of 18,986 eV. The same AMPIX cells were used as 
for the diffraction measurements, but with a higher metal-oxide loading of 22.5 mg 
and lower carbon content (8:1:1 ratio of oxide:carbon:binder) at C/3. 

Ex situ XAS was performed at beamline B18 (bending magnet) at the Diamond 
Light Source. Spectra of the Nb K edge, W L,, Ly and Ly, edges were measured at 
ambient temperature in transmission mode using energy scans above and below 
the absorption edges of 18,986 eV, 12,100 eV, 11,544 eV and 10,207 eV, respectively. 
The X-ray energy ranges were scanned over 1,000 eV (W L, + Ly) or 1,500 eV 
(Nb K, W L,,,) in steps of about 0.4 eV. To synthesize powders for ex situ characteri- 
zation, pure NbjsW5Oss powder was pressed into pellets at 1-2 MPa, discharged to 
the relevant lithium content in coin cells as previously described, then extracted in 
an argon glovebox where it was rinsed three times with 2 ml DMC (Sigma-Aldrich, 
anhydrous, >99%) and hand-ground (agate) back into powder. Upon lithiation, the 
colour of NbjsW5Oss changed from a pale yellow-green to light blue, then to dark 
blue, and through to dark maroon above approximately 0.5 Li*/TM. Samples for 
XAS were prepared in an argon glovebox by carefully grinding the active material 
(typically 15-18 mg) with dried cellulose (to a total mass of 60 + 5 mg) in an 
agate mortar and pestle to achieve a fine homogeneous sample with a change in 
absorption coefficient of about 1 across the Nb K and W Ly, absorption edges. 
The thoroughly ground powders were pressed into 8-mm-diameter pellets (about 
1 mm thick) in the glovebox. Non-air-sensitive samples (pristine and reference) 
were measured in air and lithiated samples were transferred from the glovebox in 
a custom-built (Diamond Light Source) transfer chamber with X-ray-transparent 
windows. The argon in the chamber was flushed with helium to reduce back- 
ground absorption of the incident X-rays and the samples were measured at a 
slight overpressure of helium to ensure the exclusion of air. Multiple spectra were 
collected from each sample to improve the signal-to-noise ratio and to ensure 
sample stability with respect to possible air contamination or beam damage. The 
oxides appeared to be stable; no changes were observed for any sample between 
the first and last measurement (typically 2-6 spectra per sample per absorption 
edge) over several hours in the sample chamber and 30-60 min in the X-ray beam. 

For all XAS measurements, simultaneous measurement of a reference Nb or 
W foil was performed with each spectrum to ensure X-ray beam stability and 
for energy calibration. Data merging, calibration/alignment, analysis and peak 
fitting were performed within the Athena program in the Demeter package run- 
ning IFEFFIT®*!, The Nb K edge, W L,, Ly and Li edges were calibrated to the 
reference Nb and W metal foils by setting the first maximum in the derivative plot 
of absorption versus energy equal to the standard electron binding energy®. The 
NbisW5Oss operando calibration was set to be equivalent to the ex situ calibration 
to simultaneously compare edge shifts of both datasets. 

XAS of the Nb K edge probes excitations from 1s to 5p at energies around 
19,000 eV, which is well suited for standard synchrotron analysis. In principle, 
the same type of analysis could be performed with excitations from 1s to 6p at the 
W K edge, but at 69,500 eV, it is impractically high in energy for standard beam- 
line configurations and suffers from severe core-hole lifetime broadening®. The 
W Ledges, however, at 12,100-10,200 eV are suitable for high-resolution XAS. 
The W L, edge measures 2s to 6p excitations whereas the W L,, and Ly, edges 
correspond to transitions of 2p to (near-)valence 5d states. The L,, and Ly, edges are 
split by about 1,337 eV by spin-orbit coupling; in the octahedral sites present in 
Nb sWsOs5s, these edges are further split by about 4 eV by ligand-field effects into 
transitions to tz, and eg orbitals. Recent studies have demonstrated the use of the 
5d L, edges for oxidation-state analysis in periodic!! and molecular®”® structures. 
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We considered this; however, the edge position of the W L, edge in NbjsW5Oss5 
is severely complicated by the strong pre-edge feature. In this system, the Ly edge 
offered an excellent alternative, with a stronger signal and a single contribution 
from excitations from 21pj/2 to 5d3/2. We note that the contribution expected for 2p 
to 6s transitions can be neglected at the absorption edge for early-third-row transi- 
tion metals®. Meanwhile, the W L, pre-edge peaks that are problematic for energy 
calibration provide direct evidence for local geometry around the tungsten ions. 

An SOJT distortion moves Nb** and W°* off-centre in the otherwise centro- 
symmetric environments of the octahedral MO, transition-metal sites in 
NbjsW5Os5 and Nb}gW 60o3. The SOJT results from the vibronic mixing of non- 
degenerate electronic states under nuclear displacements”°. Ina d° metal oxide, 
the energy difference between the ground and excited states is small owing to the 
pseudodegeneracy of the filled valence p orbitals and empty d orbitals in the first 
excited state. This off-centre distortion enables mixing of the dipole-transition- 
allowed p states (for example, 5p for the Nb K edge) with lower-energy d states 
(4d for the Nb K edge), which gives rise to a pre-edge feature before the main 
absorption edge in the XAS spectra of the Nb K and W L, edges and serves as a 
direct probe of local symmetry and an additional measure of the oxidation state. 
As the d® cations are reduced, the energy of the d states moves up and the SOJT 
distortion is reduced, which increases the local octahedral symmetry and decreases 
the pre-edge states and intensity. X-ray absorption pre-edge features are well known 
in tetrahedral and other noncentrosymmetric local-coordination environments, 
so the SOJT distortion and d° electronic configuration are not prerequisites for 
pre-edge peak intensity from those sites. Nbjs.W5Oss has 40 distorted octahedral 
sites and 2 tetrahedral sites per unit cell whereas NbjgW 60o3 has 30 octahedral 
sites and 4 occupied pentagonal bipyramidal sites per unit cell, so some pre-edge 
intensity may be expected, even beyond lithiation to the d' electronic configura- 
tion. Interestingly, it has been shown that the tetragonal bronze structure cannot 
be constructed from identical, regular octahedra”!, so some local distortion is 
topologically required regardless of the oxidation state or SOJT. 

We note that for the related shear-phase Nb}2WO33, the three observed electro- 
chemical regions have tentatively been assigned to the sequential reduction of 
wet!s+, Nb*+/4+ and Nb**/3+ on discharge, but these proposed oxidation changes 
have not been experimentally verified*)”. 
7Li PFG NMR spectroscopy. ’Li NMR diffusion spectra were recorded on a Bruker 
Avance III 300 MHz (7.0 T) spectrometer using a Diff50 probe head equipped 
with an extended variable-temperature 5-mm single-tuned ’Li saddle coil insert. 
All of the materials examined here had spin-lattice (T;) relaxation times greater 
than spin-spin (T>) relaxation times. Therefore, spectra were recorded with the 
stimulated-echo PFG sequence shown in Extended Data Fig. 2b to minimize T, 
losses. After the first 90° radiofrequency pulse, the net magnetization loses coher- 
ence owing to T> relaxation; thus, the time period following this pulse, which 
includes the first PFG pulse (to encode spin position), must be shorter than T). In 
the stimulated-echo sequence used here, a second 90° radiofrequency pulse was 
applied to store the net magnetization along the z axis, allowing diffusion times, 
A, commensurate with the comparatively longer T; value, as no T> relaxation 
occurred. During A, a short spoiler gradient (SINE.100) was applied to remove 
residual transverse magnetization. Afterwards, a third 90° radiofrequency pulse 
was applied, followed by a PFG pulse to decode the spin position. Sufficiently long 
delays (1 ms) were used between PFG and radiofrequency pulses to minimize eddy 
currents in the diffusion measurements. 

During this sequence, the gradient strength, g, was varied from 0.87 to 1,800 
or 2,300 G cm], and 16 gradient steps were acquired using ‘opt’-shaped pulses 
with 1,024-4,096 transients. The opt shape is a composite pulse that starts with a 
quarter of a sine wave, followed by a constant gradient, and ends with a ramp down 
(Extended Data Fig. 2b). The opt gradient pulses provided the largest gradient 
integral for a given time period, maximizing the range of diffusion coefficients 
that we could assess in this experiment. 

Spectra were analysed in phase-sensitive mode and the response of the inte- 
grated NMR signal intensity, I, to variation in gis described by the Stejskal-Tanner 
equation”?: 

I 20262 6 
i, =exp|—g°y"6 [4 au 


where Ip is the intensity in the absence of gradients, 7 is the gyromagnetic ratio 
(711= 103.962 x 10° s*! T"), 6 is the effective gradient pulse duration and D is 
the diffusion coefficient. Typical 6 values ranged from 0.8 ms to 1.5 ms and A 
values were 50-150 ms for the bronze- and the block-phase samples, respectively. 

Diffusion spectra were recorded at elevated temperatures owing to the 
increase in T> observed at high temperature (for example, T> for Li3.4Nb1gW16Oo3 
is approximately 700 1s at room temperature and 1.9 ms at 453 K). (No attempt 
was made to calibrate the temperature for this experimental setup because a 
single-tuned ’Li coil was used and no reliable ’Li reference is routinely used for 
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temperature calibration. The Bruker manual states that for static measurements, 
the temperature calibration should be within +7 K of the set value.) The increase 
in T> allowed the use of longer gradient pulses, 6, that were necessary to measure 
diffusion coefficients in the solid oxides. 

Representative ’Li diffusion decay curves for the five samples studied, 
LiyNbigWsOss (x =6.3, 8.4) and LiyNbigW 6003 (x =3.4, 6.8, 10.2), and a repre- 
sentative one-dimensional ’Li NMR spectrum are shown in Extended Data Fig. 2. 

Concerning the difficulties associated with measuring self-diffusion coefficients 
with PFG NMR spectroscopy, it is worth noting that even if the T, of these mate- 
rials was sufficiently long, the cumulative length of 6 applied before A in the pulse 
sequence of choice (Extended Data Fig. 2b) should not exceed 5 ms to mitigate 
probe damage. The limitation of cumulative 6=5 ms places an inherent limitation 
on the diffusion coefficients that can be extracted using PRG NMR spectroscopy 
and effectively places the burden on the length of T), which determines the length 
of A. 

The PFG data were initially fitted with a single exponential to extract the dif- 
fusion coefficients. In the case of the block phase, Lig 3Nb15W5Oss, fitting the data 
with two components for biexponential decay resulted in an improvement (on 
average) of a factor of two in the residual sum of squares of the fit to the PFG 
data. Therefore, the 7Li NVR signal of the Lis 3NbisWsOss sample presented in 
Table 1 can be interpreted as consisting of two Li species, one that diffuses rapidly 
and one that diffuses more slowly (we note that both diffusion coefficients are 
markedly faster than those reported for LiyTis;O012 and LiFePO,). For the higher-Li- 
content block phase, Lig. 4Nbj5W5Oss, the data were fitted well by a single-component 
exponential that yielded a similar diffusion coefficient and activation energy 
to the majority lithium phase in Lig 3Nb;>W5Os5. This suggests that within the 
two-component model, the fastest diffusing phase is present at lower lithium 
concentrations, and by the second stage of the electrochemical profile the structure 
is composed entirely of the phase with room-temperature lithium diffusion of 
about 1.6 x 10-8 m?s"7. 

For all measured stages of lithiation for the bronze phase (Li3,4Nb1gW16Oo3, 
Lig. gNbigW 16093 and Lijo.2NbigW16Oo3), the data were fitted well by a single- 
component exponential decay (as reflected in the error bars in Extended Data 
Fig. 2a, which represent 20 values obtained from the fit), and no remarkable 
improvement was observed upon fitting by a biexponential. 

Linear fits of the 7Li diffusion coefficients of both the block and bronze phases 
as a function of temperature allow extraction of the activation energy, assuming 
Arrhenius behaviour (Extended Data Fig. 2a). Extrapolation using the activation 
energy allows estimation of Li diffusion at room temperature and facilitates com- 
parison to other materials (Table 1, Extended Data Table 1). 

GITT diffusion coefficient and diffusion length. Information on electrode ther- 
modynamics, including phase transitions, and lithium kinetics” can in principle be 
extracted from GITT measurements by tracking the voltage evolution after a brief 
current pulse as lithium diffuses and the chemical potential equilibrates within the 
electrode/particles. Reliable quantitative diffusion coefficients, D,;, are however 
difficult to extract from GITT alone. To determine a diffusion coefficient from 
GITT measurements, a diffusion length (L) must be defined. However, a battery 
electrode is a heterogeneous system. First, it is a composite of active material (here, 
metal oxide), porous carbon and polymeric binder. Within this composite, there 
is a distribution of particle sizes (unless single crystals” or well defined particles 
are used; even then, diffusion varies with lattice direction). Furthermore, different 
regimes of diffusion must exist because of the presence of solid/liquid interfaces 
and porous electrode structure. Nevertheless, in an electrode that does not undergo 
severe pulverization (for example, an intercalation electrode), L is a fixed quantity 
throughout the experiment. Variation in L—a parameter required to relate the rate 
of relaxation to the diffusion—causes values of Di; to vary considerably between 
reports, even for the same material**”°, Thus, although a physically meaningful 
diffusion coefficient may not be extracted, a relative measure of diffusion is readily 
obtained. For this reason, we use an extracted proxy for lithium diffusion (D,;L~; 
Extended Data Fig. 1), which removes the uncertainty in L and enables self- 
consistent analysis of a single electrode and electrodes prepared under identical 
conditions. The addition of quantitative information from another method (for 
example, NMR spectroscopy or tracer diffusion) allows us to calibrate relative 
changes in Lit kinetics to quantitative diffusion values throughout a range of 
lithiation. 

Nb;gW 6093 operando diffraction mechanism. As shown in Fig. 4d, lithiation 
from 0 to 0.2 Lit/TM is characterized by a—b-plane expansion. The plateau-like 
region from 0.2 to 0.3 Lit/TM is associated with further a—b-plane expansion, still 
maintaining the b ~ 3a superstructure and contraction of the layers (c axis). From 
0.3 to 0.55 Lit/TM, the structure expands nearly isotropically, whereas at about 
0.55 Li*/TM, the pseudo-superstructure relationship collapses. From this point 
until 0.75 Lit/TM, b contracts rapidly, a expands and c is constant. At 0.75 Lit/TM, 
the layers begin to expand back to their initial interlayer spacing while 
b continues contracting, albeit at a slower rate, to its initial length, and a continues 


its expansion. Thus, when fully lithiated to 1.0 Lit/TM (Liz4Nb1gW 6003), b and 
care within +0.1% of the unlithiated host, while a, and thus the total volume 
expansion, is +2.8%. In the multi-redox region beyond 1.0 Li*/TM, the structure 
again expands somewhat isotropically. 

Structure family extensions. Regarding the electronic structure changes of 
NbjsW5Os5 and NbjgW 160g; upon lithiation, related partially reduced phases may 
be considered. Very high electronic conductivities are known to exist in the crys- 
tallographic shear niobium tungsten oxides”’, as well as related partially reduced 
versions of both families, for example, shear Nb)20298 and bronze Ag,3WO3” 
(A =alkali metal cation), which exhibit resistivities lower than 10~? 2 cm. 

For ionic diffusion, there are also analogies between the new lithiated com- 
plex oxides and superionic conductors, such as a-Agl® and LijgGeP2$,25)*, in 
that there are multiple sites for the cations to occupy within a disordered three- 
dimensional structure*>*4, 

In the composition and phase space of block and bronze structures, cation (such 
as early transition metals, p-block elements, aliovalent mixed metals)?”7>77°58° 
and anion (such as oxyfluorides””*”**) dopants make it possible to tune the ionic 
and electronic properties and thereby affect voltage, capacity, power capability and 
stability. Analogues are known with Na*, K*, Mg” and Ca?*, which suggests that 
the variety of possible tunnel shapes and sizes, vacant sites and three-dimensional 
connectivities is also promising for beyond-Li electrochemical energy storage, 
according to the insights into high rate and capacity discovered here. 
Application considerations. Fast charging or high-power delivery from a full 
cell requires a cathode to match the anode. Although LiFePO, has been used as 
a promising high-rate cathode®?, along with LiyTisO,, both of these electrodes 
have exceptionally flat voltage profiles. The LifePO4-Li,Tis0 2 cathode-anode 
combination provides a constant voltage but presents a serious challenge in terms 
of battery management systems (BMS). Simple and accurate BMS are crucial for 
battery applications in electric vehicles and mobile technology and are even more 
important at high rates for preventing dangerous and degradative over(dis)charg- 
ing while maximizing utility. BMS rely on their ability to measure the state of 
charge, which cannot be done by charge counting alone as the battery degrades. 
The open-circuit voltage is a more reliable measure and a thermodynamic quantity. 
In contrast to the LiyTis012/LiFePO, battery, the sloping voltage profiles of the 
new high-rate materials presented herein provide an opportunity for the mod- 
elling and electrochemical engineering/industrial communities to develop BMS 
based on sloping voltage profiles, which may prove to be a substantial commercial 
advantage for NbjsW5O55, NbigW16Oo3 and related materials in the growing area of 
high-power/fast-charging applications. 

High-voltage anode materials, such as the niobium tungsten oxides, have several 
further advantages. Above 1.0 V versus Li*/ Li®, the formation of solid-electrolyte 
interphase is minimal, which means that Li will not be lost into side reactions 
with the electrolyte. Furthermore, in a full cell against, for example, LiFePOu, 
LiN(CF3SO2), can be used to replace the more toxic LiPF, electrolyte salt without 
degrading Al current collector(s)*’, and Al can be used as the current collector 
instead of the more expensive Cu while avoiding LiAl alloying potentials (<0.3 V 
versus Li*/Li)”*. Another incentive to find replacements for nano-LiyTisO49 is 
the issue of gas evolution and associated swelling or pressure build-up?)””, which 
stems from heterogeneous catalysis between the metal-oxide surface and the 
organic electrolyte. The small particle sizes required to compensate for poor Lit 
and electronic diffusion in LiyTi;O 1 increase the reactive surface area, and the 
much smaller surface area of the ~5-j1m Nbj6WsOs5 or NbigW 6093 particles 
potentially reduces side reactions. 

Data availability. The data that support the findings of this study are available from 
www.repository.cam.ac.uk and the corresponding author upon reasonable request. 
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Extended Data Fig. 1 | Overpotential in Li||Li symmetric cells and 
GITT of niobium tungsten oxides. a, Li||Li symmetric cells were 
configured identically to those used for metal-oxide testing, with the 
exception of a second Li disk replacing the composite electrode. Rate 
testing was carried out with current densities corresponding to the rates 
shown in Fig. 2a-e, with 5 cycles at 100 1A (C/5), 500 tA (1C), 1 mA 
(2C), 2.5 mA (5C) and 5 mA (10C) and 10 cycles at 10 mA (20C), 20 mA 
(40C), 30 mA (60C) and 50 mA (100C). The ‘rates’ in parentheses indicate 
the inverse of the time (in hours) that the current was applied, simulating 
the conditions (current densities, periods of applied current and thus 
total charge transferred) of the rate test experiments. An excerpt of the 
results is shown here from the fifth cycle at C/5 until the end of the test; 
the test was performed twice with the same overpotentials observed in 
both cells. At low current densities, below 1 mA (2C), the overpotential is 
below 100 mV; however, at 5 mA (10C) the overpotential rises to 200 mV 
and increases to about 700 mV at 100C. b, c, Relative changes in lithium 
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diffusion as a function of open-circuit voltage (V,.; b) and open-circuit 
voltage versus closed-circuit voltage (V.,; c) from the GITT measurements. 
The plot in c shows the ‘thermodynamic electrochemical profiles at 

C/20 with a 12-h rest period at each point, reaching a full discharge in 
approximately one month. In Nbjs5W5Os5s, the fastest diffusion is observed 
from the dilute limit to Lig. 5(5)NbigWsOss, dropping by two orders of 
magnitude in the low-voltage window, where more than 1 Lit/TM are 
incorporated. The GITT data indicate that the second electrochemical 
region of Nb}sW5Oss is broader than typically observed in a two-phase 
reaction, but the observed discontinuity in the Dj;L~? values in this region 
suggests that Nb}s,W5Os5 approaches two-phase behaviour. The average 
diffusion coefficient in Nb igW 16093 is similar to that of Nb ieW5Os5. The 
bronze also displays discontinuities in D,,L~? at 2.1, 1.85 and 1.7 V. In 
both phases, the low-voltage region (below 1.25 V, well over 1 Lit/TM) is 
characterized by an increasing overpotential and suppressed kinetics. 


© 2018 Springer Nature Limited. All rights reserved. 


ARTICLE 


1E-9 
O / m Li, Nb,,W.O,. 
ae m Li, Nb,,W,O,, 
o . 
= 1E-10 - Li, .ND, W605 b ae er 
os < Li,,Nb,,W,.0,, 
c 
| 

3 : Mh 
 1E-11 HIT 
S ‘Li ws 
6 —_? 
o 
£ 
iS 1E-12 
— 

1E-13 


22 24 26 28 
1000-T ' (K’) 


Li,,,Nb,,W,,O,,. T = 453 K Li, Nb,,W,,O,,, T = 453 K Li, ,Nb,,W,,O,,, T = 453 K 
0 500 1000 1500 2000 2500 0 500 1000 1500 2000 2500 i 0 500 1000 1500 2000 2500 
Gradient Strength (G-cm ') Gradient Strength (G-cm') Gradient Strength (G-cm'') 
h 
Li, |Nb,,W,O,,, T = 353 K Li, |Nb,,W,O,,, T = 383 K 
0 500 1000 1500 2000 2500 0 500 1000 1500 2000 2500 209 400 0 100 200 
Gradient Strength (G-cm'') Gradient Strength (G-cm’') 3 “Li (ppm) 


Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | Lithium diffusion from ’7Li PEG NMR 
spectroscopy. a, The lithium diffusion coefficients of Liy.NbjsWsOss5 

(x= 6.3, 8.4) and Li,Nb;g3W 160o93 (x = 3.4, 6.8, 10.2) were measured 

in the temperature range 333-453 K (Table 1). The filled (85% signal 
contribution) and empty (15% signal contribution) symbols for 

Lig 3Nbis5W5Oss correspond to the observed two-component diffusion (see 
main text and Methods). In most cases, the error bars (20, obtained from 
the fit) are smaller than the sizes of the points. b, Stimulated-echo PFG 
sequence used to measure ’Li diffusivities, showing both radiofrequency 
(Li) and magnetic-field-gradient (G,) pulses. Here, the gradient pulse 
duration (t,) includes the up-ramping, time on and down-ramping of the 
opt composite gradient pulses. During A, a short spoiler gradient was used 
to remove residual transverse magnetization. c-g, Representative “Li decay 
curves showing the normalized NMR signal intensity as a function of 


gradient strength for the bronze structures Lijo 2Nb1gW160b3 at 453 K (c), 
Lig gNbigW 16093 at 453K (d) and Li3.4Nb 1g W 16093 at 453K (e) and 

the block structures Lig 3NbigW5Oss5 at 353 K (f) and Lig 4Nb}gW5Os5 

at 383 K (g). Black circles represent experimental data points and red 
lines represent mono- (c-e, g) or biexponential (f) fits to the data by 

the Stejskal-Tanner equation. Biexponential fits to all samples except 
Lig 3NbisWsOss did not lead to improved fits over those obtained with 
monoexponential fits. The poor signal-to-noise ratio of Lig 3NbisW50s5; 
even with more than 100 mg of electrode sample, did not allow us to 
explore alternative models beyond the mono- and biexponential models, 
such as models considering the effect of anisotropic diffusion, which 
manifests as subtle differences in echo attenuation”. h, A representative 
one-dimensional 7Li NMR spectrum (Lig 4Nb;¢W5Oss, static, 403 K) 
showing the shift (6 Li) region and lineshape. 
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Extended Data Fig. 3 | Nb}; W5O;5 X-ray absorption spectroscopy. 

a-d, Li,Nbj5W5Os5 ex situ (see Supplementary Fig. 14) Nb K edge 
XANES spectra (a, b) and derivative spectra (c, d). e, f, Operando Nb 

K edge XANES spectra of NbjsW5Oss and 22 discharge spectra, with each 
successive spectrum at about +11 mA hg! (e), and operando derivative 
spectra (f; 23 spectra shown, 5 colours labelled). The pre-edge and main 
edge are at about 18,991 and 19,004 eV, respectively. Spectra in b and d are 


58 


19000 19010 


Energy (eV) 


—Li 


leaNby, 


11550 
Energy (eV) 


11550 


Nb, ,W,0, 


1y7Nb,W.055 


m4 aND,,W.0.5] 


Nb ,.W,0, 
Nb, .W,O, 


19020 


11560 


b c 
os ot a] 
3.0 - 
2.55- 
¢ 
2 r 
2 Wy 
a S 
a oy 
2 g 
3 1.5 = 
4a iS 
g 10 4 z 
5 x] 
= 05 4 
0.0 
18980 18990 19000 19010 19020 
Energy (eV) 
e 
4.5 L NP uW.0r5 
—— operando 4 
—— operando 8 
§ — operando 12 
= —— operando 16 
= —— operando 20 
g 1.0 
<x 
se] 
oO 
N 
£05 
S 
z 
0.0 
18980 18990 19000 19010 19020 
Energy (eV) 
h i 
c 
§ i 
5 8 
< 8 
ae) A 
N iS 
is} 
=z 
0 1 . 1 . 
11540 11550 11560 
Energy (eV) 

k I 
8 8 
3 £ 
%S %S 


0.12 


0.08 


18990 


18990 


11540 


19000 
Energy (eV) 


19010 19020 


19000 
Energy (eV) 


19010 19020 


11550 
Energy (eV) 


11560 


11540 


in p for clarity. 


vertically offset by 0.2 and 0.02, respectively, for clarity. g—1, Li,NbigWsOss 


© 2018 Springer Nature Limited. All rights reserved. 


12140 


11550 
Energy (eV) 


11560 


d(norm. abs.)/dE 


12080 12100 12120 


Energy (eV) 


ex situ W L,, edge XANES spectra (g, h), derivative spectra (i, j) and 
second-derivative spectra (k, 1). Spectra in h, j and 1 are vertically offset by 
0.5, 0.2 and 0.4, respectively, for clarity. m—p, LiyNb1>W5Oss ex situ 


ARTICLE 


12140 
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Extended Data Fig. 4 | NbigW 15093 X-ray absorption spectroscopy. spectra (i, j). Spectra in f, h and j are vertically offset by 0.5, 0.2 and 0.4, 
a-d, Li,NbigW 16093 ex situ Nb K edge XANES spectra (a, b) and respectively, for clarity. k-m, Li,Nb;sW 6093 ex situ W L, edge XANES 
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Extended Data Fig. 5 | Lattice evolution of Nb},W5O55 and Nb;gsW 6093 
upon lithiation. a, b, Absolute lattice parameter values resulting from 
Rietveld refinement of operando diffraction data, analogous to that of 
Fig. 4c, NbigW5Os5 (a) and Fig. 4d, NbjgW160o3 (b). Error bars show 

the standard deviation of each parameter, as estimated from the fits 
(approximately equal to the symbol height in b). Shading distinguishes 
the different structural regions. For NbjgW16Oo3, the second stage 
(two-phase region) contains two sets of lattice parameters. c, Structure 
evolution of Nbjs5W5Os5 as a function of rate. At high rates, the lithiation 
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reaction becomes inhomogeneous (Extended Data Fig. 6) and cannot be 
fitted with a single set of lattice parameters over the whole electrode. The 
shaded grey area for the 5C data corresponds to the range of each unit-cell 
parameter. d, Structure evolution of NbjgW160o;3 as a function of rate. 

The mechanism of Li,Nb1gW 1606; lattice evolution does not appear to be 
strongly rate-dependent. The reaction extends further at lower rates within 
the same voltage range, partly because of a smaller overpotential at lower 
current densities (Extended Data Fig. 1a). 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Solid-solution structure evolution and 
reversibility of NbjsW5Os5. a—c, Profile evolution of selected reflections 
at C/2 and d, the corresponding electrochemical discharge profile. 

e-g, Profile evolution of selected reflections at 5C and h, the corresponding 
electrochemical discharge profile. a, e, (604). b, f overlapping (110) and 
(111). ¢, g, (10 01) (left) and (609) (right), with smaller overlapping 
reflections. The evolution in each case commences analogously; as 
lithiation increases, the structure evolution at high rate becomes 
inhomogeneous and the contraction of the ac block is not fully realized. 
i, The overlapping (110) and (111) reflections shift smoothly to larger d 
spacing (smaller 20) for the entire sample at C/2, whereas in j, there is 
substantial intensity across a range of 26 values at high rate. This inter- 
peak intensity is shown more clearly in the inset of j. The intensity range 
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represents a range of lattice parameters (Extended Data Fig. 5c) and 

an inhomogeneous solid-solution reaction probably resulting from 
inhomogeneous lithium transport and concentration gradients within 

the electrolyte****. Diffraction patterns are shown from 0 to about 

145 mA hg“! (ascending), which corresponds to the full 5C discharge and 
partial C/2 discharge. k, 1, Structure reversibility over a full electrochemical 
lithiation/delithiation cycle of NbjsWs5Oss at C/2. In k, the (14 09)/(407) 
(left) and (020) (right) reflections are displayed over a full operando 
synchrotron X-ray diffraction discharge—charge cycle from 3.0 to 1.0 V 
with multi-redox (de)lithiation. The symmetry of discharge and charge is 
apparent, although a small amount of lithium remains in the structure 
after charging (Extended Data Fig. 7). 
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Extended Data Fig. 7 | Electrochemical profile evolution and early- 
cycle lithium retention of NbjsW5O55 and NbigW160o3. a-f, There 

is an activation process on the first cycle for NbigWsOss (a-c) and 

NbisW 160o93 (d-f), which extends to a much smaller extent in the next 
several cycles, leading to an increase in intercalation voltage at the first 
‘plateau-like’ feature and a broadening of the dQ/dV peaks (b, ¢, e, f). The 
phenomenon is associated with a retention of lithium in the structure. 
The occurrence of this activation phenomenon is structure-independent, 


indicating that it may have an electronic origin. The electrochemical 

data in a-f were collected at C/5 but the phenomenon is also observed 

at other rates. g, Diffraction patterns of NbjsW5Os5s before lithiation 

and after the first charge from operando measurements at C/2. h, Lattice 
parameters of the pristine and charged structure show changes that 
indicate that some lithium was retained in the structure after charging 
the electrode, commensurate with the changes from first- to second-cycle 
electrochemistry. 
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Extended Data Fig. 9 | Prospective lithium positions and pathways 
in block-type and bronze-type ternary niobium tungsten oxides and 
electrochemical comparisons to binary niobium oxides. a, b, Bond 
valence sum maps of NbjsW50s5 (a) and NbjgW16Oo3 (b) depict stable 
lithium positions and pathways according to bond valence energy 
landscape calculations performed in 3DBVSMAPPER™". Calculations 
were performed over a fine grid with 149 x 20 x 116 points computed 
for NbijsWsOss and 61 x 184 x 20 points computed for NbigW 16093 
along their respective crystallographic axes. Isosurface levels are 
shown at ‘2.0 eV’, which is a parameter used to visualize ionic pathways 
and not a quantitative estimation. The bond valence sum and bond 
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valence energy landscape provide an indication of lithium positions 

and diffusion pathways in complex or novel systems and have shown 
good agreement with experimental and computational investigations of 
structure and dynamics'?!~”3, c, Possible intrablock lithium positions 

for Nbi}sWsOss based on Li,MO; in the low-lithium regime, before it 
undergoes intercalant-induced distortion. d, e, Block phases NbigWsOss 
and H-Nb2Os are compared on the basis of Lit/TM (d) and gravimetric 
capacity (e) on the third cycle at C/5. f, g, Bronze-like phases NbjgW 16093 
and T-Nb2O; are compared on the basis of Lit/TM (f) and gravimetric 
capacity (g) on the third cycle at C/5. 
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Extended Data Table 1 | Lithium self-diffusion coefficients of lithium-ion battery electrode materials, solid electrolytes, liquid electrolytes 


and reference compounds 


Compound Structure Type Dui (m?-s~1) T (K) Technique Reference 
Lie.sNb16WsOss Block, cs 2.1«10-12 / 1.7%10-"9 298 PFG NMR (extrapolated) This work 
Lis sNbisWsOss Block, cs 2.7x10-11 / 5.2%10-19 413 PFG NMR This work 
Lis 4Nb1sWsOss Block, cs 1.6x10-"9 298 PFG NMR (extrapolated) This work 
Lis.4NbieWsOss Block, cs 6.5x10-13 403 PFG NMR This work 
Lis.4Nb1sW16003 Bronze-like 1.1%10°8 298 PFG NMR (extrapolated) This work 
Lis.4Nb1sW16093 Bronze-like 2.0x10-12 413 PFG NMR This work 
Lie sNbisW16Oe3 Bronze-like 1.1%10°779 298 PFG NMR (extrapolated) This work 
Lig. sNb1sW160¢3 Bronze-like 1.8x10-12 413 PFG NMR This work 
Lito.2Nb1sW16Oo3 Bronze-like 1.1x10°8 298 PFG NMR (extrapolated) This work 
Lito.2Nb1sW16093 Bronze-like 2.710712 413 PFG NMR This work 
LisoGeP2S12 Thio-LISICON 2x10-12 298 PFG NMR pe 
LitoGeP2S12 Thio-LISICON 4x10 453 PFG NMR ee 
Li7GePSs Thio-LISICON 2x10-12 298 PFG NMR ee 
Li7GePSs Thio-LISICON 4x10-"1 453 PFG NMR 82 
LisoSnP2S12 Thio-LISICON 1.410712 298 PFG NMR 95,96 
LitoSnP2S12 Thio-LISICON 4x10-"1 453 PFG NMR - 
Lit1SizPS12 Thio-LISICON 3.5x10-12 298 PFG NMR aS 
Lis1Si2zPS12 Thio-LISICON 4x10 453 PFG NMR 95 
Li7P3S11 Thio-LISICON 1-5x10-'2 303 PFG NMR 7 
Li7P3S11 Thio-LISICON 2-11%10-12 353 PFG NMR af 
[-LisPS4 Thio-LISICON 5.4«10-18 373 PFG NMR 28 
amorphous-LisPS4 Amorphous (Thio-LISICON) —_6.5x10-'9 303 PFG NMR 92 
amorphous-LisPS4 Amorphous (Thio-LISICON) 1.6-3.4x 10-12 353 PFG NMR a 

Lio 6[Lio2Sno.sS2] Layered (01) 2-20x10-12 298 PFG NMR Teo 

Lio 6[Lio 2Sno.sS2] Layered (O01) 2.6x10-10 407 PFG NMR 708) 
Lis.4Vo.6Sio. 404 LISICON 5x10-16 333 tracer 191 

0.25 M LiCl (aq.) Solution 9.6x10-1° 298 PFG NMR ate 

1.0 M LiPFe in PC Liquid electrolyte 8.3x10-1" 298 PFG NMR 108 

1.0 MLiPFe in EC Liquid electrolyte 9.9x10-11 298 PFG NMR 108 

1.0 M LiPFe in DEC Liquid electrolyte 2.5x10-10 298 PFG NMR 103 

Li B-alumina (ab plane) B-alumina 4.2x10-"" 298 PFG NMR ‘104 

Li B-alumina (c-axis) -alumina 6.4«10-13 333 PFG NMR 104 

LisN Li3N 1.410710 534 PFG NMR a 
Lit.sAlosGe1.s(PO4)3 NASICON 2.9x10-18 311 PFG NMR 108 
Lit.2Alo2Tit.8(POa)3 NASICON 1.5x10-12 250 NMR relaxometry + PFG NMR ane 
Lie.sLasZr16Tao4O12 Garnet 3.5x10-13 353 PFG NMR AOE 
Li7LasZr2O12 Garnet 1.8x10-18 298 NMR relaxometry 106 
LivLasZr2O12 Garnet 1.3x10-17 325 NMR relaxometry 108 
Li7LasZr2O12 Garnet 3.3x10-14 530 NMR relaxometry 108 
Graphite (Stage |) Graphite 1-2x10-15 298 NMR relaxometry _ 

Li metal bec 5x10-15 298 PFG NMR (extrapolated) 2 

Li metal bec 1x10-12 400 PFG NMR pau 
LisTisO12 Spinel 3.2«10-15 298 u'-SR 11,112 
Lis.7TisO12 Spinel 2.7x10-16 298 NMR relaxometry Ang 
LiTizO4 Spinel 3.6x10-15 298 u*-SR a 
f-LizTiO3 LizSnO3 2x10-17 433 NMR relaxometry oe 
LixTiO2 (x = 0.12) micro Anatase component 4.7x10-16 293 NMR relaxometry ae 
LixTiO2 (x = 0.12) micro Li-titanate component 1.3x10-15 293 NMR relaxometry _ 
LixTiO2 (x = 0.06) nano Anatase component 1.9x10-76 293 NMR relaxometry ae, 
LixTiO2 (x = 0.06) nano Li-titanate component 5.7x10-16 293 NMR relaxometry We 
LixTiO2 (x = 0.12) nano Anatase component 1.1x10-"6 293 NMR relaxometry _ 
LixTiOz (x = 0.12) nano Li-titanate component 1.8x10-76 293 NMR relaxometry ne 
LixTiOz (x = 0.12) nano Li-titanate component 4.9x10-16 413 NMR relaxometry 46 
Lio.ssTiOz nano Li-titanate 1.7x10-"6 293 NMR relaxometry HAG 
Lio.ssTiOz nano Li-titanate 4.6x10-16 413 NMR relaxometry ae 
LiFePO, Olivine 1-6x 10-20 300 DFT/Tracer 4950 
LiMn2O4 Spinel 1x10-20 350 NMR relaxometry aye 
LiMn2O4 Spinel 1.8x10-16 623 Tracer "me 
LiNbO3 LiNbO3 7.5x10-15 890 NMR relaxometry we 
LisNbO4 LisNbO4 4x10! 353 NMR relaxometry _ 
LisNbO4 LisNbO4 1x10-16 553 NMR relaxometry ae 


Battery electrode materials are not typically amenable to the direct measurement of lithium diffusion via PFG NMR owing to rapid paramagnetic (7; and Tz) relaxation and slow diffusion. The weak 


paramagnetism (as indicated by the s. 
elevated temperature, where the 79 re! 


mall NMR shift) and ultrafast diffusion of block-phase NbigWsOss and bronze-like-phase Nb1gW16093 enabled PFG NMR measurement, but only at slightly 
axation time was increased. Room-temperature Dj); values were calculated by extrapolating from the elevated temperature data and assuming Arrhenius 


behaviour. Lithium diffusion measurements from NMR relaxometry are included here for several anode materials in which diffusion is too slow to be measured by PFG NMR®%. cs, crystallographic 
shear; LISICON, lithium superionic conductor; NASICON, sodium superionic conductor; O1, layered structure with octahedral lithium and single-layer repeat stacking of transition-metal layers; 


PC, propylene carbonate; bcc, body-centred cubic; ,.t-SR, muon spin resonance; DFT, density functional theory. 
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Evolution of a central neural circuit 
underlies Drosophila mate preferences 


Laura F. Seeholzer!, Max Seppo!, David L. Stern? & Vanessa Ruta!* 


Courtship rituals serve to reinforce reproductive barriers between closely related species. Drosophila melanogaster 
and Drosophila simulans exhibit reproductive isolation, owing in part to the fact that D. melanogaster females produce 
7,11-heptacosadiene, a pheromone that promotes courtship in D. melanogaster males but suppresses courtship in 
D. simulans males. Here we compare pheromone-processing pathways in D. melanogaster and D. simulans males to 
define how these sister species endow 7,11-heptacosadiene with the opposite behavioural valence to underlie species 
discrimination. We show that males of both species detect 7,11-heptacosadiene using homologous peripheral sensory 
neurons, but this signal is differentially propagated to P1 neurons, which control courtship behaviour. A change in the 
balance of excitation and inhibition onto courtship-promoting neurons transforms an excitatory pheromonal cue in 
D. melanogaster into an inhibitory cue in D. simulans. Our results reveal how species-specific pheromone responses can 
emerge from conservation of peripheral detection mechanisms and diversification of central circuitry, and demonstrate 
how flexible nodes in neural circuits can contribute to behavioural evolution. 


Animals display an extraordinary diversity of behaviour both within 
and between species. Although there is increasing insight into how 
learning and experience modify neural processing to produce varia- 
tions in individual behaviour, far less is known about how evolution 
shapes neural circuitry to generate species-specific responses. Cross- 
species comparative studies have identified genetic loci that explain 
behavioural diversity’, but only rarely have they examined the neural 
substrate upon which this genetic variation acts. Consequently, how 
the nervous system evolves to give rise to species-specific behaviours 
remains unclear. 

As species diverge, their reproductive isolation is often reinforced by 
the development of behavioural differences that signify species identity 
and discourage interspecies courtship. The rapid evolution of courtship 
rituals therefore provides an entry point to examine the neural mech- 
anisms that underlie behavioural divergence between closely related 
species. For example, D. melanogaster and D. simulans diverged 2-3 
million years ago” (Fig. 1a) and, although these cosmopolitan species 
frequently encounter each other in the environment, they remain 
reproductively isolated and rarely mate. When copulation does occur, 
it results in inviable or sterile offspring. The avoidance of interspecies 
mating therefore benefits both species. Indeed, both D. melanogaster 
and D. simulans males are choosy in selecting mates and, when offered 
a choice, preferentially court conspecific females (Fig. 1b). 

One mechanism for selective courtship is the use of sex- and 
species-specific pheromones that either promote the courtship of 
conspecific females or suppress the pursuit of inappropriate mates®. 
Pheromone production has rapidly diversified across drosophilids’, 
such that D. melanogaster females predominantly produce 7,11- 
heptacosadiene (7,11-HD) on their cuticle whereas D. simulans females 
produce 7-tricosene (7-T), as do D. simulans and D. melanogaster 
males®. This pheromonal difference seems to be sufficient for species 
discrimination: perfuming D. simulans females with 7,11-HD renders 
them attractive to D. melanogaster males but unattractive to D. simulans 
males®. 7,11-HD is therefore detected by males of both species, but 
plays an opposing role in controlling their courtship decisions. 


Species-specific pheromone responses could arise from the evolu- 
tion of peripheral detection mechanisms or the central circuits that 
process pheromone signals to regulate courtship. To differentiate 
between these possibilities, we performed a direct comparison of the 
homologous pheromone pathways in D. simulans and D. melano- 
gaster males. Here we demonstrate that species-specific responses to 
7,11-HD emerge from the reweighting of excitatory and inhibitory 
inputs at a central node in the courtship circuit, highlighting how func- 
tional adaptations of central sensory processing pathways can lead to 
divergent behaviours. 


Ppk23 mediates 7,11-HD detection 

A critical step in Drosophila mate assessment occurs when a male 
taps the abdomen of another fly with his foreleg to taste their cuticular 
pheromones”. D. melanogaster and D. simulans males whose foreleg 
tarsi have been surgically removed still court vigorously (Extended 
Data Fig. la—c), but do so promiscuously (Fig. 1b). Therefore, although 
the detection of cuticular pheromones is not necessary for robust 
courtship’, it is essential for species discrimination. 

Several classes of gustatory sensory neurons on the D. melanogaster 
male foreleg detect pheromones to differentially regulate courtship. 
One heterogeneous sensory population expresses the Ppk23 DEG/ 
ENaC channel: a subset of Ppk23* neurons detects female pheromones, 
including 7,11-HD, to promote courtship, and another subset detects 
male pheromones to inhibit courtship!!"!%. A smaller population of 
foreleg sensory neurons expresses the Gr32a receptor and detects 7-T 
to suppress inappropriate pursuit of D. simulans females and D. mela- 
nogaster males!®"*, Although it is not known whether Gr32a and Ppk23 
directly bind cuticular hydrocarbons, they serve as essential components 
of these pheromone transduction pathways!°"4. We therefore inves- 
tigated how Gr32a and Ppk23 shape mate preferences in D. simulans 
by using CRISPR-Cas9 genome editing to generate mutant alleles 
(Extended Data Fig. 1d). Both Gr32a and ppk23 mutant males pur- 
sued conspecific females with the same vigour as wild-type D. simulans 
males (Fig. 1c), suggesting that either these receptors do not contribute 
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Fig. 1 | Pheromone regulation of D. simulans courtship. a, Predominant 
cuticular hydrocarbons of related species. b, Courtship preferences of 

D. melanogaster (mel) and D. simulans (sim) males with foreleg tarsi intact 
(+) or removed (—). ¢, d, Courtship indices (c) and preference indices (d) 
of wild-type (WT), Gr32a~/~ and ppk23~/~ D. simulans males offered 

D. simulans and/or D. melanogaster females. e, Preference indices of 
wild-type or ppk23~/~ D. simulans males offered D. simulans females 
perfumed with ethanol (EtOH) or 7,11-HD. Statistical tests used were a 
one-sample t-test (b, d, e) and a Kruskal-Wallis test (c). **P < 0.01, 

***P < 0.001, NS, not significant (b, d, e); different letters mark 
significant differences in courtship by Dunn’s multiple comparison 

test (c). Data are mean and s.d., with individual data points shown. See 
Supplementary Table 1 for details of statistical analyses. 


to the detection of 7-T in D. simulans, or that 7-T does not play a role in 
promoting male courtship’®, despite being the predominant cuticular 
pheromone on D. simulans females®. D. simulans Gr32a mutant males 
also did not court D. melanogaster females or females of more distant 
species (Fig. 1c, d, Extended Data Fig. le, f). Therefore, contrary to its 
role in mediating courtship suppression in D. melanogaster'®'*, Gr32a 
does not seem to influence mate choices in D. simulans. 

By contrast, ppk23 mutants pursued D. melanogaster females and 
other drosophilids carrying diene pheromones with the same inten- 
sity as they courted D. simulans females (Fig. 1c, Extended Data 
Fig. le), and were unable to differentiate between D. melanogaster and 
D. simulans females in preference assays (Fig. 1d, Extended Data 
Fig. 1f). Moreover, ppk23 mutants were not deterred from courting 
D. simulans females that were perfumed with 7,11-HD (Fig. le, Extended 
Data Fig. 1g), suggesting that their promiscuous courtship reflects an 
inability to detect the D. melanogaster pheromone. As such, males of 
both species rely on ppk23 to detect 7,11-HD, but detection of this pher- 
omone initiates opposing behaviours in the two species—promoting 
courtship in D. melanogaster while suppressing courtship in D. simulans. 
We therefore developed genetic tools in D. simulans to examine the 
sensory neurons in which Ppk23 is expressed and the downstream 
circuits that they activate to identify the neural adaptations that 
contribute to species-specific pheromone responses. 


fruitless specifies courtship circuitry 

In D. melanogaster, the male-specific isoform of the Fruitless transcrip- 
tion factor (Fru)!®!7 marks the majority of Ppk23* sensory neurons 
in the male foreleg!'~!? along with the neural circuitry mediating 
most of the components of male courtship, from sensory detection to 
motor implementation. To gain genetic access to the repertoire of Fru? 
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Fig. 2 | Fruitless has a conserved role in male courtship. a, Fru+ neurons 
(green) with neuropil counterstain (magenta) in brains of D. melanogaster 
and D. simulans males. Scale bar, 30 jum. b, Courtship preference indices of 
wild-type (WT) and fru~’~ D. simulans males. c, Fraction of time that 
solitary males displayed courtship behaviours with (right) or without (left) 
optogenetic stimulation of Fru* neurons. Statistical tests used were a 
one-sample t-test (b) and a Wilcoxon matched-pairs signed-rank test (c). 
*P < 0.05, ***P < 0.001. Data are mean and s.d., with individual data 
points shown. See Supplementary Table 1 for details of statistical analyses. 


neurons in D. simulans, including those that detect and process 7,11- 
HD, we integrated either the GFP or Gal4 coding sequence into the 
first intron of the fru locus (Extended Data Fig. 2a). We observed that, 
in both species, fru marks a similar ensemble of neurons distributed 
throughout the male nervous system, with comparable innervation 
patterns evident in most brain neuropils (Fig. 2a, Extended Data 
Fig. 2b-g). 

Given the anatomic similarity of Fru* neurons, we wanted to 
confirm the evolutionary conservation of fru as a master regulator of 
male courtship behaviours!*°. We found that, as in D. melanogaster, 
mutation of fru™ resulted in D. simulans males that exhibited aberrant 
mate preferences, whereas the optogenetic activation of Fru* neurons 
triggered multiple components of the courtship ritual in an isolated 
male (Fig. 2b, c, Extended Data Fig. 2h, i). Therefore, in both species, 
fru marks circuits that specify male courtship towards appropriate 
sexual partners, providing an inroad to trace and compare the neural 
pathways that underlie mate discrimination, from the sensory periphery 
to higher brain centres. 


Conserved peripheral sensory responses 

To investigate the pheromone tuning of the Ppk23* sensory popula- 
tion in D. melanogaster and D. simulans males, we expressed the Ca2+ 
indicator GCaMP6s in Fru* neurons and monitored the aggregate 
responses of foreleg sensory afferents in the ventral nerve cord of a 
male as his tarsus contacted the abdomen of a female (Fig. 3a). We 
found that the Fru* sensory neurons of both D. melanogaster and 
D. simulans males exhibited comparable pheromone tuning, responding 
robustly to the taste of a D. melanogaster female but weakly to that 
of a D. simulans female (Fig. 3b, c). In males of both species, sensory 
neuron responses were strongly attenuated in ppk23 mutants (Fig. 3b, c, 
Extended Data Fig. 3e), verifying that Ppk23 plays a conserved and 
essential role in pheromone detection. 

To compare the distribution of 7,11-HD-responsive sensory neurons 
across species directly, we generated a ppk23-Gal4 construct in 
D. simulans, taking advantage of the fact that, in D. melanogaster, the 
ppk23 promoter faithfully reproduces endogenous channel expression’. 
We found that the ppk23 promoter from either species drove expres- 
sion in a comparable number of sensory neurons in the male foreleg, 
whose axons exhibited a similar sexually dimorphic projection pattern 
within the ventral nerve cord (Extended Data Fig. 3a—d). Imaging the 
aggregate activity of these Ppk23* sensory afferents revealed equiva- 
lent pheromone tuning across males of both species, with significantly 
stronger responses to the taste of a D. melanogaster female than to that 
of a D. simulans female (Fig. 3d, Extended Data Fig. 3e). Moreover, 
we observed no difference in the distribution or magnitude of func- 
tional responses in Ppk23* soma when we stimulated individual 
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Fig. 3 | Conserved pheromonal tuning of Ppk23* and Fru* foreleg 
sensory neurons. a, Schematic of ventral nerve cord preparation 

used for imaging (top) and Fru” foreleg sensory neurons expressing 
GCaMP (bottom). Scale bar, 10 um. b, c, Functional responses of Fru* 
foreleg afferents in D. melanogaster (b) or D. simulans (c) males evoked 

by the taste of a D. melanogaster or D. simulans female. Representative 
activity traces with time of taps indicated by coloured tick marks (left) and 
average normalized fluorescence (AF/F) in wild-type (WT) and ppk23~/— 
males (right). d, Functional responses (average AF/F) of Ppk23* foreleg 


sensory bristles on the foreleg with synthetic 7,11-HD (Extended Data 
Fig. 3f). As in D. melanogaster'*!, Ppk23* soma were paired beneath a 
sensory bristle, and only one soma of each pair responded to 7,11-HD. 
Somatic responses to synthetic 7-T were negligible in both species 
(data not shown), mirroring the weak responses evoked at the popula- 
tion level by the taste of a D. simulans female (Fig. 3d). 

Together, these experiments demonstrate that a quantitatively and 
qualitatively similar population of Ppk23* sensory neurons is tuned 
to 7,11-HD in both D. melanogaster and D. simulans males. To assess 
whether the activation of Ppk23* sensory neurons could replicate the 
opposing courtship behaviours elicited by 7,11-HD, we expressed 
CsChrimson in this sensory population in males of both species and 
examined how optogenetic activation influenced courtship of a con- 
specific female. Whereas optogenetic stimulation of Ppk23* sensory 
neurons in D. melanogaster males drove increased courtship”? , it 
inhibited courtship towards an otherwise attractive conspecific female 
in D. simulans males (Fig. 3e), replicating the courtship suppression that 
results from perfuming a D. simulans female with 7,11-HD. Therefore, 
activation of the homologous Ppk23* sensory neuron population is suf- 
ficient to drive opposing behavioural responses in D. melanogaster and 
D. simulans males, suggesting that differences must exist in the down- 
stream circuits that link pheromone detection to courtship decisions. 


Divergent responses in central circuits 
In D. melanogaster, male-specific P1 neurons form a central node in 
the Fru* circuitry that integrates input from multisensory pathways to 
represent the suitability of a potential mate and triggers the initiation 
of courtship?!-”°, Anatomic labelling revealed that, in both species, P1 
neurons exhibit rich projections in the lateral protocerebral complex 
(LPC, Fig. 4a), a sexually dimorphic Fru* neuropil that receives input 
from multiple sensory processing pathways and outputs to descending 
neurons that drive the component behaviours of male courtship*’?>””. 
To test whether P1 neurons play a conserved role in regulating court- 
ship across species, we introduced a transcriptional enhancer (R71G01- 
Gal4) that labels this neural population in D. melanogaster’? into the 
D. simulans genome and used it to drive expression of CsChrimson 
(Extended Data Fig. 4a). Optogenetic activation of P1 neurons in 
D. simulans males strongly enhanced courtship of conspecific females 
and drove almost incessant courtship of inappropriate targets, includ- 
ing D. melanogaster females and a small rotating magnet (Fig. 4b, 
Supplementary Video 1). Pursuit of the magnet was significantly 
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afferents evoked by the taste of a D. melanogaster or D. simulans female. 
e, Courtship indices towards conspecific females during optogenetic 
stimulation of Ppk23* neurons in D. melanogaster (top) and D. simulans 
males (bottom). Statistical tests used were a Wilcoxon matched-pairs 
signed-rank test (b-d) and a Kruskal-Wallis test (e). *P < 0.05, 

**P < 0.01, ***P < 0.001 (b-d); different letters mark significant 
differences in courtship indices by Dunn’s multiple comparison test (e). 
Data are mean and s.d., with individual data points shown. See 
Supplementary Table 1 for details of statistical analyses. 


reduced when it was stationary or moving slowly, highlighting the 
importance of motion for vigorous courtship”*.”° (Extended Data 
Fig. 4c). Courtship towards all targets remained increased after stimu- 
lation, indicating that transient activation of P1 neurons is sufficient to 
trigger an enduring state of sexual arousal across species***8 (Fig. 4b, 
Extended Data Fig. 4b-f). 

Notably, titration of the stimulating light revealed that evoked court- 
ship in D. simulans males was significantly weaker towards D. melano- 
gaster females than D. simulans females (Fig. 4c), raising the possibility 
that 7,11-HD may suppress courtship by countering the excitation of P1 
neurons. To compare how pheromone signals are propagated from the 
periphery to P1 neurons and other central Fru* populations, we mon- 
itored responses either in the LPC or in the P1 neurons in a tethered 
male as he tapped the abdomen of a target fly with his foreleg (Fig. 4d, 
Extended Data Fig. 4g). In D. melanogaster males, robust responses were 
evoked in the LPC by the taste of a D. melanogaster female, but not a 
D. simulans female, reflecting strong excitation of P1 neurons by the 
pheromones of an appropriate mate”? (Fig. 4e, g, Extended Data Fig. 4h, 
Supplementary Videos 2, 3). By contrast, in D. simulans males, nei- 
ther the P1 neurons nor any other Fru* neural populations in the LPC 
were activated in response to the taste of a D. simulans female (Fig. 4f, 
Supplementary Video 4), which is consistent with behavioural evidence 
that contact pheromones are not necessary to promote D. simulans 
courtship®!> . Neurons in the LPC of D. simulans males were, however, 
weakly activated by the taste of a D. melanogaster female, but these 
signals failed to propagate to the P1 neurons (Fig. 4f, h, Supplementary 
Video 5). Responses were not apparent in ppk23 mutants, verifying that 
pheromone signalling to the LPC relies on Ppk23* sensory pathways 
(Extended Data Fig. 4i, j). Opposing behavioural responses to 7,11-HD 
in the two species therefore appear to be mirrored by divergent P1 
neuron excitation. 


Species-specific central circuit changes 

In D. melanogaster and D. simulans males, equivalent pheromone 
responses at the sensory periphery are translated into differential 
excitation of P1 neurons, suggesting that the ascending pathways 
that convey pheromone signals from the forelegs to the LPC have 
diverged. In D. melanogaster, 7,11-HD signals are transmitted from 
Ppk23* sensory neurons to vAB3 neurons, whose dendrites reside in 
the ventral nerve cord and whose axons project to the LPC, providing 
a direct excitatory route to the P1 neurons”® (Fig. 5a). vAB3 neurons 
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Fig. 4 | Divergent pheromone responses in courtship-promoting 

P1 neurons. a, P1 neurons (grey) innervate the Fru* (green) LPC. 

b, c, Optogenetic stimulation of P1 neurons in D. simulans males. 

b, Courtship towards a rotating magnet (top), D. simulans female 
(middle), or D. melanogaster female (bottom). Fraction of flies courting 
(left; grey boxes indicate illumination with bright light) and courtship 
indices pre-, during, and post-stimulation (right). c, Courtship indices 

of D. simulans males towards D. simulans (top) and D. melanogaster 
(bottom) females in dim or bright white light or the indicated illumination 
intensity at 627 nm. d, Experimental set-up for in vivo functional 
GCaMP imaging (top) and schematic of Fru* neurons (bottom; the 
regions imaged are outlined). e, f, Functional responses of Fru* neurons 
in the LPC of D. melanogaster (e) and D. simulans (f) males evoked by 
tapping D. melanogaster and D. simulans females. Representative GCaMP 


also extend collaterals into the suboesophageal zone (SEZ), where 
they synapse onto GABAergic (aminobutyric acid-releasing) mAL 
interneurons”*. mAL axons arborize extensively in the LPC and provide 
inhibitory input onto P1 neurons, forming a feedforward inhibitory 
circuit motif that tempers P1 neuron excitation and regulates the gain 
of pheromone responses”!”’. In D. melanogaster, P1 neurons thus 
receive excitatory and inhibitory input even in response to the taste of 
a conspecific female, with 7,11-HD evoking net excitation to trigger 
courtship. Anatomic labelling of D. simulans vVAB3 and mAL neurons 
revealed broadly conserved projection patterns within the LPC (Fig. 5b, 
Extended Data Fig. 5), indicating that they remain anatomically poised 
to synapse onto P1 neurons. 

Given the structural conservation of these ascending pathways, we 
considered whether there might be functional differences in how pher- 
omone signals are transmitted through this circuit to generate divergent 
P1 neuron responses. Functional imaging revealed that vAB3 neurons 
were similarly tuned to pheromones across species, with robust 
responses elicited only by the taste of a D. melanogaster female, and 
not a D. simulans female (Fig. 5c, Extended Data Fig. 6a—d). Moreover, 
in both species, vAB3 pheromone responses were lost in ppk23 mutants 
(Extended Data Fig. 6b-d). Therefore, although Ppk23* sensory 
neurons drive opposing courtship behaviours in D. melanogaster and 
D. simulans males, they nevertheless elicit comparable pheromone 
responses in the homologous ascending pathways in the brain. 

To compare the role of mAL neurons across species directly, we intro- 
duced a genetic driver (R25E04-Gal4) that labels mAL neurons” into 
D. simulans (Extended Data Fig. 6e). Optogenetic activation of mAL 
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shown. Scale bars for anatomic images, 10|1m. See Supplementary Table 1 
for details of statistical analyses. 


neurons in D. simulans males strongly attenuated courtship (Extended 
Data Fig. 6f), replicating the robust courtship suppression mediated by 
mAL neurons in D. melanogaster” and confirming that this population 
inhibits neurons in the LPC in both species. Functional imaging 
revealed that mAL neurons were equivalently excited by the taste of a 
D. melanogaster female in both D. melanogaster and D. simulans males 
(Fig. 5d, Extended Data Fig. 6g). Therefore, in both species vAB3 and 
mAL pathways are similarly activated by D. melanogaster female pher- 
omones, suggesting that alterations in the strength of their signalling to 
P1 neurons might underlie the emergence of species-specific mate pref- 
erences, such that 7,11-HD can evoke excitation of P1 neurons only in 
D. melanogaster males to initiate courtship of a D. melanogaster female. 

To examine this possibility, we directly stimulated vVAB3 neurons 
through iontophoresis of acetylcholine onto their dendrites within the 
ventral nerve cord and performed multi-plane functional imaging to 
visualize activated Frut neurons in the brain (Fig. 5e, Extended Data 
Fig. 7a, b). Although this stimulation elicited equivalent responses in the 
vAB3 and mAL neurons of both species, P1 neurons and other neurons 
of the LPC were excited only in D. melanogaster and not in D. simulans 
males (Fig. 5e, f, Extended Data Fig. 7d-g), mirroring the differential 
propagation of pheromone signals through this pathway in vivo. In both 
species, the responses of all Fru* neurons were lost after we severed 
the vAB3 axons with a two-photon laser, verifying that vAB3 neurons 
mediate these divergent activity patterns (Extended Data Fig. 7c). 

To assess whether mAL-mediated inhibition could counter vVAB3 
excitation to suppress P1 neurons in D. simulans, we used a two-photon 
laser to sever the mAL axonal tract and prevent the transmission 
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*P < 0.05; **P < 0.01, ***P < 0.001. Data are mean and s.d., with 
individual data points shown. See Supplementary Table 1 for details of 
statistical analyses. 


of their GABAergic signal to the LPC. After the severing of mAL, 
we found that P1 neurons could be excited by vAB3 stimulation in 
D. simulans males, although to a lower level than in D. melanogaster 
males (Fig. 5f, Extended Data Fig. 7g). Pharmacological weakening of 
inhibition by injection of the GABA-receptor antagonist, picrotoxin, 
into the LPC similarly enhanced pheromone responses in vivo, 
unmasking excitation specifically in response to the taste of a 
D. melanogaster female (Extended Data Fig. 6h). These results suggest 
that mAL-mediated inhibition antagonizes vAB3 excitation to fully 
suppress P1 neuron responses in D. simulans males, but not in D. melano- 
gaster males, revealing how alterations in excitatory and inhibitory input 
to this population may generate divergent responses to the same 
pheromone cue. 


Discussion 

The sensory periphery has been proposed to be the most evolutionarily 
labile element of the nervous system*”*”, as changes in the expres- 
sion or tuning of sensory receptors can allow for the emergence of 
species-specific behaviours without necessitating potentially more 
complex developmental rewiring of central pathways in the brain. By 
contrast, our results suggest that species-specific behavioural responses 
to 7,11-HD arise through functional alterations in how pheromone 
signals are propagated through a structurally conserved central 
circuit that consists of parallel excitatory and feedforward inhibitory 
branches. By reweighting the balance of excitatory vAB3 and inhibitory 
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mAL signalling to P1 neurons, 7,11-HD is transformed from an excit- 
atory signal that promotes courtship in D. melanogaster males into 
an inhibitory signal that suppresses courtship in D. simulans males. 
Although our analysis highlights the importance of this pheromone 
pathway in shaping species-specific mate preferences, we cannot 
exclude the possibility that additional inputs to P1 neurons or other tar- 
gets of mAL and vAB3 neurons also contribute to divergent courtship 
decisions. Nevertheless, our data suggest that the branched architec- 
ture of pheromone-processing pathways serves as a substrate for the 
evolution of mate preferences, pointing to the potential existence of 
favourable sites within neural circuits to instantiate adaptive behav- 
ioural changes, analogous to how specific nodes within developmental 
regulatory networks contribute to morphological diversity*®. 

The conserved tuning of Ppk23* neurons suggests that D. simulans 
males dedicate this sensory pathway to the detection of D. melanogaster 
female pheromones, rather than sensing the chemical cues carried by 
conspecific females. Consistent with this idea, although P1 neurons 
are sufficient to elicit courtship in D. simulans males, they are not 
excited by the taste of a D. simulans female. Given that D. simulans 
cuticular pheromones are sexually monomorphic and offer ambigu- 
ous signals for mate recognition, males probably rely on additional 
sensory inputs for their arousal. Indeed, the fervent courtship exhibited by 
D. simulans ppk23 mutants towards females of different species demon- 
strates that D. simulans males can be aroused in the absence of any species- 
specific excitatory cue. Dienes, such as 7,11-HD, actively suppress this 
arousal, presumably through recruitment of strong mAL-mediated 
inhibition via Ppk23* pathways. Together, these observations rein- 
force the notion that pheromone communication in Drosophila serves 
to focus a male’s desire, such that flies lacking cuticular pheromones 
can be inherently attractive and appropriate mate choices are honed by 
specific inhibitory chemical cues®!®"4, 

Peripheral adaptations are likely to play an important role in the 
evolution of novel chemical sensitivities***?. In this case, however, 
preserving the sensory periphery while varying central circuits provides 
a mechanism to alter the behavioural valence of a single chemical cue. 
As D. melanogaster and D. simulans diverged, their reproductive isola- 
tion was probably strengthened by the ability of both species to detect 
the same pheromone but assign it a different meaning through these 
central circuit modifications. Notably, in D. melanogaster, P1 neuron 
excitability is regulated by the social history of a male****. This suggests 
that both experience-dependent and evolutionary adaptations may act 
on the same neural substrate to modify sensory integration and mate 
choices, similar to how phenotypic plasticity may facilitate morpho- 
logical evolution*®. Therefore, functional reweighting of sensory inputs 
at flexible nodes in the nervous system, shaped by evolutionary selec- 
tion or individual experience, may allow for alternative behavioural 
responses to the same sensory signal. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
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METHODS 


Data reporting. Preliminary experiments were used to assess variance and deter- 
mine adequate sample sizes in advance of conducting the experiment. We used 
random.org/sequences to randomize the order of all behaviour experiments. To 
control for potential variations in experimental conditions across days, we collected 
a similar sample size for each variable, every day the experiment was conducted. 
For functional experiments, we interleaved genotypes and female stimuli when 
applicable. All behaviour experiments were conducted with the experimenter 
blinded to the genotype or condition of any male or female fly that was a variable 
in a given experiment. The experimenter was unblinded only after analysis of 
the assay. The experimenter was not blinded to the genotype of males used in 
functional imaging assays. 

Fly stocks and husbandry. Flies were housed under standard conditions at 25°C 
under a 12h light:12h dark cycle. Fly stocks and sources were as follows: Drosophila 
melanogaster Canton-S, 20xUAS-IVS-GCaMP6s (Bloomington; 42746, 42749), 
UAS-mCD8::GFP (5130, 5137), LexAop-GCaMP6s (53747), 10xUAS-IVS-myr::t- 
dTomato (32222), R71G01-Gal4 (39599), AbdB-Gal4 (55848) R25E04-Gal4 
(49125), and 20xUAS-IVS-CsChrimson.m Venus (55134) were obtained from the 
Bloomington Stock Center. D. sechellia (14021-0248.25) and D. erecta (14021- 
0224.01) were obtained from the UCSD Stock Center. The following were 
obtained as indicated: D. ananassae and D. simulans (Ruta laboratory), D. simu- 
lans attP2039°° (Y. Ding and D. Stern, Janelia Research Campus); SplitP1-Gal4”* 
(D. Anderson, Caltech); fru’*4?” and fruS**!6 (B. Dickson, HHMI/Janelia Farm 
Research Campus); D. melanogaster ppk23-Gal4" (K. Scott, UC Berkeley); LexAop- 
myr::tdTomato (C. Mendes, Columbia University); UAS-SPA-t2a-SPA-GFP* (Ruta 
laboratory). Supplementary Table 1 provides detailed descriptions of genotypes 
used in each experiment. 

Courtship behaviour assays and analysis. To standardize fly size and life his- 
tory across trials, all flies used for behavioural assays were reared in food vials 
at a low density (3 females and 3 males seeded into vial). All experimental flies 
were male and all stimulus flies were virgin females unless otherwise noted. Males 
for all assays were collected as virgins, placed in individual food vials (d=3 cm, 
h=9.5cm) and housed in isolation for 3-6 days. Males were added to behavioural 
assays by direct aspiration from the food vial without ice or CO; anaesthetization, 
except for the tarsi ablation experiments in which males were ice-anaesthetized. 
Virgin females were group-housed in food vials and aged 3-6 days. All behavioural 
assays were conducted at Zeitgeber 0 to 3h except for assays using flies reared in 
the dark. All behavioural assays were conducted in a heated, humidified room 
(25°C, 46% relative humidity) on a back-lit surface (Logan Electric Slim Edge- 
Light Pad A-5A, 5400K, 6kIx). For all preference assays, only males who spent 
more than 5% of the time courting (>30s of total courtship) were included in 
the analysis. Courtship behaviours included in the analysis were singing, tapping, 
licking, orienting, abdomen bending and chasing. QuickTime?7 software was used 
when analysing courtship videos. 

For all preference assays (Figs. 1b, 1d, le, 2b), a male and two female flies were 
placed into a 38-mm diameter, 3-mm height circular chamber with sloping walls 
(courtship arena)**. The experimenter, who was blinded to the genotype of the 
flies, kept track of the females during the assay either by noting which female 
was introduced first to the courtship arena or by painting a small white dot on 
the thorax of the female 16-20h before the start of the experiment under ice 
anaesthesia. Results were not affected by the method used to differentiate between 
females and the experimenter was unblinded only after analysis. The preference 
index is the amount of time the male spent courting one female subtracted from 
the amount of time spent courting the other female divided by the total time spent 
courting within a 10-min assay. The wide spread of the data reflects the fact that 
individual males will sometimes continue to pursue a single female throughout 
the assay even if both females are equivalent. In Extended Data Fig. 1 we plot 
the courtship indices (time spent courting/total time of assay) underlying the 
preference indices of Fig. 1. 

For the tarsi ablation assays (Fig. 1b, Extended Data Fig. la—c), males were 
ice-anaesthetized 16-20h before the start of the experiment and had either the 
distal three tarsal segments on both forelegs removed or a sham treatment that 
left their appendages intact. For single-choice assays (Extended Data Fig. 1b, c), 
the rear leg tarsi were ablated as a control. Males were then returned to a food 
vial to recover in isolation. Males without foreleg or rear leg tarsi still vigorously 
court females. 

In D. simulans single-pair courtship assays (Figs. 1c, 3e, 4b, 4c; Extended Data 
Figs. 1b, Ic, le, 4e, 4f, 6f), a single virgin female and a D. simulans male were loaded 
into a courtship arena. Courtship index (time spent courting/total time together) 
was measured for the 10 min after the male was introduced into the chamber. 

For the chaining assay (Extended Data Fig. 2i), eight males were loaded into a 
courtship arena and the chaining index (time in which at least three of the males 
were simultaneously courting each other divided by the total time of the assay) was 
measured for 10 min after the males were introduced into the chamber. 


For the preference assays with perfumed females (Fig. le), we provided a 
male the choice between a D. simulans virgin female perfumed with 7,11-HD 
(7(Z),11(Z)-heptacosadiene, 10 mg ml!, Cayman Chemicals, 100462-58-6) or 
ethanol using a previously published protocol®. In brief, 7,11-HD or ethanol was 
added to 1 ml of ethanol in a 2-ml glass vial (Thermo Scientific, 03-377D), placed 
on ice and dried using a stream of nitrogen gas. Seven ice-anaesthetized female 
flies were placed in each vial, and the vials were then gently vortexed three times 
for 30s before placing the females on food for an hour to recover. After perfuming, 
separate aspirators were used to handle the flies to avoid pheromone contamina- 
tion. Courtship indices underlying courtship preference in Fig. le were plotted in 
Extended Data Fig. 1g. 

For fruS** optogenetic stimulation experiments (Fig. 2c), fru 
UAS-CsChrimson.tdTomato or fru° parental controls were reared in the dark 
for 3-7 days after eclosion. Male flies were transferred to food containing 400 1M 
all-trans-retinal (Sigma, R2500-10MG) 16-20h before the assays”. Single male 
flies were loaded into a courtship arena and allowed to acclimate for 1 min. Flies 
were subsequently recorded for 7 min, alternating between 1 min of dim white 
light followed by 1 min with constant LED stimulation (530 nm Precision LED 
Spotlight with Uniform Ilumination—PLS-0530-030-S, Mightex Systems at an 
intensity of 0.02mW mm ”). Genotypes were established using PCR screening of 
the UAS transgene. We quantified a courtship behaviour index, which represented 
the fraction of time a male spent performing courtship behaviours. 

For ppk23-Gal4 (Fig. 3e) and R25E04-Gal4 (Extended Data Fig. 6f) optogenetic 
stimulation experiments, we used D. simulans w* R25E04-Gal4, ppk23-Gal4 and 
UAS-CsChrimson.tdTomato parental stocks and D. melanogaster w~ ppk23-Gal4 
and UAS-CsChrimson parental stocks lacking balancer chromosomes. The original 
D. simulans R25E04-Gal4, ppk23-Gal4 and UAS-CsChrimson.tdTomato parental 
stocks were in a background mutant for white (w~ ), which exhibited extremely low 
courtship indices (around 5% on average), presumably owing to their low visual 
acuity. By contrast, D. melanogaster transgenic lines maintained robust courtship 
even in a white mutant background (data not shown). Therefore, we backcrossed 
D. simulans stocks to wild-type flies to generate w* strains and confirmed their 
genotype by PCR. All crosses were reared in the dark. Virgin male progeny were 
reared in isolation in the dark for 3-7 days after eclosion and then transferred to 
food containing 400|.M all-trans-retinal 16-20h before the assays”*. We found 
that D. simulans courtship was less robust under single-wavelength LED illu- 
mination or dim white-light illumination, so we conducted our assays using the 
same lighting conditions used for the non-optogenetic courtship assays (Logan 
Electric Slim Edge-Light Pad A-5A, 5400K, 6klx). Single male flies were loaded 
into a courtship arena that contained a conspecific virgin female and the courtship 
index was calculated over a 10-min period after the male was introduced. Owing to 
the lack of balancers and visual markers in D. simulans, stable, homozygous stocks 
of most transgenes were difficult to generate. Therefore, progeny of crosses were a 
mix of wild-type, parental controls and experimental flies. The experimenter was 
blinded to the genotype of the flies until after the experiment. Genotypes were 
established using PCR amplification of the Gal4 and UAS transgenes. Males of 
all genotypes exhibited similar levels of locomotion when they were not courting 
(data not shown). 

For optogenetic stimulation of P1 neurons in D. simulans (Fig. 4b, c, Extended 
Data Fig. 4c-f), we used R71G01-Gal4>UAS-CsChrimson.m Venus males that car- 
ried a wild-type (w*) X chromosome. As in other behavioural experiments, P1 
neuron-elicited courtship pursuit was far weaker in males mutant for white (data 
not shown). For optogenetic stimulation of P1 neurons in D. melanogaster, w’; 
R71G01-Gal4>UAS-CsChrimson.m Venus males lacking balancer chromosomes 
were used. We observed a high degree of lethality in both the D. melanogaster and 
D. simulans R71G01-Gal4>UAS-CsChrimson crosses grown on standard fly food 
containing cornmeal (presumably owing to the low levels of retinal metabolized 
from vitamin A). Therefore, we grew these crosses on sugar-yeast food in the 
dark (per 11 of water: 100 g Brewer’s yeast, 50 g sucrose, 15g agar, 3 ml propionic 
acid, 3 g p-hydroxy-benzoic acid methyl ester). Progeny of parental crosses were 
group-housed in the dark for 3-7 days after eclosion, before males were transferred 
to food containing 400 1M all-trans-retinal 48 h before the assays”. Single male 
flies were loaded into a courtship arena that contained either a virgin D. simulans 
female, virgin D. melanogaster female or a magnet (radius, 1 mm; height, 1 mm) 
rotating in a circle at 9mm s"! (ref. *°). Upon loading the male fly into the cham- 
ber with the target, we alternated between 2 min of dim light (101x) and 2 min of 
bright light (6 kIx) in a 14-min assay. Dim light was used because it was sufficient 
to enable males to visually track a target object but insufficient to optogentically 
activate the P1 neurons, as evidenced by the lack of courtship towards a magnet 
or D. melanogaster female before bright illumination. Assays were filmed (Sony 
Alpha 6) and later scored for courtship behaviour, binned in 1-s intervals. We 
calculated ‘fraction courting’ as a function of time by dividing the number of males 
courting during a one-second interval (aligned from the start of the assay) by the 
total males tested. Courtship indices were also calculated for each individual at 
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different times relative to the optogenetic stimulation: ‘pre’ represents the courtship 
index of the 2 min before the first bright-light stimulus, ‘stimulus’ represents an 
average of the courtship indices during bright-light illumination period and ‘post’ 
represents an average of the courtship index after the bright-light illumination. For 
the parental controls (Extended Data Fig. 4d-f), we used wt; UAS-CsChrimson. 
mVenus males grown in an identical manner to the experimental flies and similarly 
placed on retinal for 48h. For the non-retinal controls (Extended Data Fig. 4d-f), 
w*; R71G01-Gal4>UAS-CsChrimson.m Venus males were placed in a new vial of 
sugar-yeast food for 48 h before the experiment. To characterize evoked court- 
ship as a function of light intensity (Fig. 4c), each experiment was initiated by 
illuminating for 2 min with dim light (10x) to establish a baseline, then at increas- 
ing intensities of 627-nm LED illumination with 2 min at each intensity, finally 
ending with 2 min of bright white-light illumination. A power meter (Coherent 
PowerMax-USD light sensor and Coherent PowerMax PC Software) was used to 
measure the intensity of 627-nm illumination in the behavioural chamber during 
the assay. To examine how elicited courtship depends on the speed of the magnet 
(Extended Data Fig. 4c), each male was given the opportunity to court a magnet 
moving at 0, 3, 6, 10 and 20mm s_! during bright white-light illumination. Magnet 
speed order was randomized and there were 1-min periods in between stimulus 
trials in which the light was off and the magnet was stationary. 

Targeted mutagenesis and transformation in D. simulans. The protocols 
described below combine methods for CRISPR mutagenesis”. Supplementary 
Table 2 provides details of single-guide RNA (sgRNA) sequences, sgRNA primers 
and sequencing primers. 

CRISPR guide RNAs had an 18-20 nucleotide target sequence and were flanked 
by a3’ PAM sequence (‘NGG’) anda 5’ T7 RNA polymerase recognition sequence 
(‘GG’). Before designing sgRNAs, Sanger sequencing was carried out across target 
genomic sites to identify single-nucleotide polymorphisms. Guide RNA template 
was amplified using KOD HotStart (Millipore, 71086-3) and 0.5 1M forward and 
reverse primers. Reactions were cycled on an Eppendorf MasterCycler (98°C 30s, 
35 cycles of (98°C 10s, 60°C 30s, 72°C 15s), 72°C 10 min, 4°C hold) and then 
purified (PCR purification kit, Qiagen). In vitro transcription of 300ng of ssRNA 
template DNA using T7 MEGAscript kit (Ambion) was carried out at 37°C for 
16-20h. Turbo DNase was added for an additional 15 min at 37°C before adding a 
10% ammonium acetate stop solution. The RNA was phenol/chloroform extracted 
and precipitated with isopropanol at —20°C for 16-20h. Precipitated RNA was 
washed with 70% ethanol, resuspended with RNase-free water, and frozen in small 
aliquots at —80°C for long-term storage. Before injection, the ssRNA was thawed 
on ice and purified using sodium acetate and ethanol before being resuspended 
in RNase free water. 

CRISPR injection mixtures contained 300 ng jl”! recombinant Cas9 protein 
(CPO1, PNA Bio), 40 ng jl’ sgRNA (per guide) and 125ng il” single-stranded 
DNA oligonucleotide. The CRISPR injection mixture was combined on ice and 
placed at —80 to —20°C until the injection. pC-31 mediated recombination injec- 
tion mixtures contained donor plasmid (1 jig jul!) and helper plasmid (1 pg jl~!), 
both of which were purified using endotoxin-free plasmid prep kits (Qiagen). 
Rainbow Transgenic Flies, Inc. performed all injections. 

To generate mutant alleles of ppk23 and Gr32a (Extended Data Fig. 1d), we 
designed sgRNAs targeting three regions spanning 200 bp of the first exon for 
each gene. These sgRNAs were combined into a single cocktail and injected into 
approximately 200 wild-type D. simulans embryos. Only CRISPR guide sequences 
that generated the mutations are listed in Supplementary Table 2. The adult Go 
flies were individually crossed to wild-type male or virgin female flies. For each Go 
cross, we PCR screened 8-16 progeny (F\s) for the presence of an insertion or dele- 
tion. Genomic DNA was extracted from the F, flies by removing a midleg, hindleg 
or wing and placing it into a well of a 96-well plate containing 2011 of lysis buffer 
(10 mM Tris pH 8.2, 1 mM EDTA pH 8.0, 25mM NaCl, 400g ml Proteinase K). 
The fly was then placed in the corresponding well of a 96-well deep-well plate 
(Brandtech VWR, 80087-070) filled halfway with fly food and capped with cotton. 
The 96-well plate containing lysis buffer and fly legs or wings was then heated at 
37°C for 1h followed by a 2-min heat inactivation at 95°C. Genomic DNA (3.2 ul) 
from the reaction was used as the PCR template for a 25-11 reaction of Apex Taq 
Red Master Mix (Genesee Scientific, 42-138) for 35 cycles. The PCR screening 
primers spanned an approximately 400-bp region encompassing the three ssRNA 
target sites. To maximize the resolution of heterozygous indels, we ran the entire 
PCR reaction on a 2% agarose gel at 70 V. Using these specifications, the smallest 
indel we detected was around 20 bp. We backcrossed any flies that had a hetero- 
zygous mutation to wild-type flies and then homozygosed their progeny. Flies 
were Sanger-sequenced to determine if an in-frame stop codon was introduced. 
Homozygous stocks were genotyped and Sanger-sequenced for three generations 
to ensure that the population was pure. 

For recombination into the fru locus, we prescreened sgRNAs to identify those 
that mediate efficient cutting. Nine sgRNAs were designed, six of which targeted 
the region upstream of the first coding exon and three of which targeted the first 
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exon. Pools of three sgRNAs were injected into 100 embryos and genomic DNA 
was extracted from surviving flies. We first used the T7 endonucleasel (T7E1) 
assay for preliminary qualitative analysis of cutting propensity (http://www.crispr- 
flydesign.org/t7-endo-i-assay/). Two positive hits from the T7E1 assays were ana- 
lysed using MiSeq analysis®, which revealed that over 95% of the reads in the PCR 
product were mutated. We used only these two sgRNAs (listed in Supplementary 
Table 2), one targeted to the first exon and one targeted upstream of the start codon 
of the first intron, for generating mutant flies. 

To generate fru“? flies, we integrated in a 200-bp single-stranded oligonucle- 
otide designed to have the minimal 51 bp attP sequence™, including a diagnostic 
restriction enzyme site and around 70 bp homology arms that flanked the CRISPR 
targeted site into the fru locus (Extended Data Fig. 2a). To generate fru/~ flies, 
we integrated a similar attP-containing oligonucleotide into the first exon of the 
Fru™ coding sequence, replacing the ATGATG start site with TTGTTG (Extended 
Data Fig. 2h), as has been reported previously in D. melanogaster'’. The sgRNA, 
attP-oligo and Cas9 protein were injected into around 200 embryos. Gos were 
singly crossed to wild-type virgin flies. F}s with successful integration of the attP 
site were identified by PCR genotyping, isolated and sequenced using methods 
described above. fru“? and fru~/~ F,s were backcrossed to wild-type flies and 
then homozygosed. Homozygous stocks were genotyped for three generations to 
ensure that the population was pure. 

We used (pC31-mediated recombination to integrate attB plasmids contain- 
ing transgenes into the fru*"? locus (Extended Data Fig. 2a). We chose not to 
use eye colour as a visual marker to avoid complications of the white muta- 
tion on behaviour. To determine whether the transgene was homozygous, we 
screened Fjs using the protocol described above for the binary presence of a 
PCR product using one primer pair that spanned the transgene and the genomic 
locus. To create a stable stock of flies, we crossed homozygous virgin females to 
D. simulans males with a balancer allele on their 3rd chromosome (In(3R)Ubx, 
Flybase ID FBab0023784, UCSD Stock Center, 14021-0251.098). Progeny with 
the Ubx visible mutation were crossed together and subsequent progeny were 
genotyped. 

attB-SAS-GFP (Extended Data Fig. 2a) was made by amplifying enhanced green 
fluorescent protein (eGFP) from pUAST-mCD8GFP using primers that attached 
a splice acceptor site’ and Kozak sequence onto the 5’ end of the GFP and an 
SV40 termination sequence onto the 3’ end. Nested-PCR was performed to attach 
Gibson-assembly adaptors onto the GFP PCR product, which was then combined 
with PCR-linearized pHD-DsRed-attP using Gibson assembly (NEB). The plasmid 
was then digested with EcoRI and NotI to insert a 51-bp attB oligonucleotide with 
flanking EcoRI and Not! sites. The double-stranded oligonucleotide was made by 
annealing two single-stranded oligonucleotides together. 

attB-SAS-Gal4 (Extended Data Fig. 2a) was made by cloning attB-SAS and 
Gal4 DNA fragments into pHD-DsRed cut with EcoRI and Spel using Gibson 
Assembly (NEB). The digestion removed 3xP3-DsRed. The attB-SAS fragment 
was amplified from attB-SAS-GFP and the Gal4 fragment was amplified from 
pBPGUw. 

We generated an attP landing site with an inactivated EYFP gene using CRISPR- 
Cas9 mutagenesis. We co-injected embryos of D. simulans strains carrying an attP 
landing site marked with 3XP3::EYFP with p{CFD4-EYFP-3xP3::DsRed}** and 
Cas9 mRNA and sibling-mated surviving adults. We screened for progeny with 
reduced or no EYFP expression in the eyes. Flies with EYFP” were bred to homozy- 
gosity and the 3XP3::EYFP transgene in each line was re-sequenced to confirm 
the presence of the mutation and to confirm that the mutation did not disrupt 
the attP landing site. 

To generate flies expressing GCaMP6s under UAS control, we co-injected 
p{GP-JFRC7-20XUAS-IVS-GCaMP6s}“4 and pBS130 (containing pC-31 integrase 
under control of a heat-shock promoter) into a D. simulans attP, EYFP- strain 
2178 (ref. 3°) and screened for w* integrants. We generated one D. simulans 
UAS-CsChrimson transgenetic line by co-injecting p{20XUAS-IVS-CsChrimson. 
tdTomato}* and pBS130 into attP, EYFP™ strain 2178 and screening for w* inte- 
grants. We generated a second D. simulans UAS-CsChrimson transgenetic line by 
co-injecting a piggyBac vector pBac(20xUAS-CsChrimson.m Venus, 3xp3::dsRed) 
and a piggyBac transposase helper plasmid into the Lethal Hybrid Rescue strain of 
D. simulans and then screened for dsRed expression in the eyes. The CsChrimson. 
mVenus piggyBac insertion was mapped with TagMap”’ to base pair 23,569,712 
on chromosome 3R. 

The D. simulans ppk23-Gal4 plasmids were generated by amplifying the 2.695-kb 
fragment upstream of the D. simulans ppk23 promoter, analogous to previously 
published methods’*. Using a BP-clonase Gateway reaction, the sim-ppk23 
promoter was recombined into pBPGUw (Addgene, 17575). pC31-mediated 
recombination was used to integrate sim into D. simulans attp2176*° (Fig. 3d and 
Extended Data Fig. 3a—d), R25E04-Gal4, 3xp3::DsRed in D. simulans attP2176*° 
(Extended Data Fig. 6e) and pBPGuW R71G01-Gal4 in D. simulans attP2176*° 
(Extended Data Fig. 4a). 
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Immunohistochemistry. To visualize D. simulans fru°’? (Fig. 2a, Extended Data 
Fig. 2b-g), D. melanogaster fru°*4>UAS-GCaMP (Fig. 2a, Extended Data Fig. 2b-g), 
D. simulans ppk23°*"*> UAS-GCaMP (Extended Data Fig. 3b) and R25E04- 
Gal4>UAS-GCaMP (Extended Data Fig. 6e), brains from 1-3-day-old adults were 
dissected in Schneider’s medium for 1h then immediately transferred to cold 1% 
paraformaldehyde (Electron Microscopy Sciences) and fixed for 16-20h at 4°C. 
Samples were then washed in PAT3 buffer (0.5% BSA, 0.5% Triton X-100, 1X PBS 
pH 7.4) three times, with the last two washes incubated for 1h on a nutator at room 
temperature. Brains were blocked in 3% normal goat serum for 90 min at room 
temperature. Primary antibodies in 3% normal goat serum were incubated for 3h at 
room temperature then left at 4°C for 16-20h. Primary antibodies used were 1:20 
mouse anti-Brp (nc82, Developmental Studies Hybridoma Bank), 1:1,000 sheep 
anti-GFP (sim fru”, mel fru©*!4> UAS-GCaMP, ppk23-Gal4>UAS-GCaMP and 
R25E04-Gal4>UAS-GCaMP, Bio-Rad, 4745-1051) and 1:100 rabbit anti-GABA 
antibody (D. simulans fru©*”, Sigma, A2052). Brains were then washed in PAT3 
buffer. Samples were incubated in secondary antibody for 3h at room tempera- 
ture then for 5-7 days at 4°C. Secondary antibodies used were 1:500 anti-sheep 
Alexa Fluor 488, anti-rabbit Alexa Fluor 546, anti-mouse Alexa Fluor 647 and 
anti-mouse Alexa Fluor 555 (ThermoFisher Scientific). Brains were washed in 
PAT3 buffer three times then once in 1X PBS, rotating at room temperature for 
5 min. Samples were mounted in Vectashield (Vector Laboratories) in 5/8th inch 
hole reinforcements placed on glass slides. Images were captured on a Zeiss LSM 
880 using a 40 x objective. 

Two-photon functional imaging. All imaging experiments were performed on an 
Ultima two-photon laser scanning microscope (Bruker Nanosystems) equipped 
with galvanometers driving a Chameleon Ultra II Ti:Sapphire laser. Emitted 
fluorescence was detected with either photomultiplier-tube or GaAsP photodiode 
(Hamamatsu) detectors. Images for ex vivo experiments were acquired with an 
Olympus 60 x, 1.0 numerical aperture objective and in vivo experiments were 
acquired with an Olympus 40x, 0.8 numerical aperture objective (LUMPLFLN). 
All images were collected using PrairieView Software (Version 5.4) at 512 
pixel x 512 pixel resolution with a frame rate from 0.2-0.4 Hz when imaging a 
region of interest (ROI) and 0.7-0.8 Hz when imaging the entire field of view. 
Saline (108 mM NaCl, 5mM KCl, 2mM CaCh, 8.2mM MgCh, 4mM NaHCOs, 
1mM NaH >PO,, 5 mM trehalose, 10mM sucrose, 5mM HEPES pH 7.5, osmolarity 
adjusted to 275 mOsm) was used to bathe the brain for all imaging experiments 
unless otherwise noted. 

To prepare flies for in vivo imaging of Fru* and Ppk23* sensory afferents in 
the ventral nerve cord (VNC; Fig. 3a—d, Extended Data Fig. 3e), the wings and 
all legs except one foreleg were removed from a CO}-anaesthetized male. The 
single-legged male was tethered to a piece of clear packing tape covering a hole in 
the bottom of the modified 35-mm Petri dish using a hair placed across his cervical 
connectives. The body was oriented such that the ventral side faced the inside of 
the dish. A rectangular hole, the length and width of the body of the male fly, was 
cut from the tape and the fly was positioned such that the ventral half of the body 
was placed above the plane of the tape. Great care was taken to ensure that the 
foreleg was extended so that the tibia and femur did not contact the thorax. Small 
dots of UV-curable glue were used to secure the eyes, part of the thorax and the 
tip of the abdomen to the tape. The dish was then filled with saline and the cuticle 
covering the first thoracic ganglion was gently removed, taking care to not damage 
the foreleg nerve. The preparation was positioned on the two-photon microscope 
and an ROI was centred on the most ventral portion of the first thoracic ganglion 
of the VNC corresponding to the intact foreleg (Fig. 3a). To prepare a stimulating 
female abdomen, a pin was attached to the dorsal thorax of virgin female 
D. melanogaster or D. simulans fly with her head, wings and legs removed so that 
the abdomen could make contact with the distal tarsal segments of the foreleg 
of the male fly. To guide stimulation with the fly abdomen, an 850 nm IR light 
was used to illuminate the chamber. The fly was imaged from the side using a 
Point Grey Firefly camera mounted with a 1 x-at-94 mm Infinistix lens fitted 
with a shortpass IR filter (850 nm, optical density 4, Edmund Optics) to block 
925 nm two-photon laser illumination and viewed using FlyCapture2 Software 
(2.12.3.2). After recording a 10-s baseline, the experimenter gently tapped the 
female abdomen onto the tarsi of the experimental fly once every 10s for 6-8 bouts. 
Three replicates per preparation (total 18-24 tapping bouts) were conducted with 
D. simulans and D. melanogaster stimuli interleaved. 

Images and quantification of Ppk23* soma in the foreleg of the male were 
completed using a Zeiss Axioplan 2 scope under Nomarski optics and widefield 
fluorescence at 40 x or 63 x. Images were acquired through a Zeiss AxioCam and 
the Axiovision software (Extended Data Fig. 3a). Somata were counted only in the 
first three tarsal segments of the foreleg. 

We modified published methods!*”! for in vivo imaging of Ppk23+ soma in 
the foreleg (Extended Data Fig. 3f). Male ppk23-Gal4>UAS-GCaMP flies were 
isolated as virgins, aged 3-6 days, CO3-anaesthetized, decapitated and immobilized 
by folding a piece of Parafilm over the body such that the first five tarsal segments 


extended out of the Parafilm. The immobilized fly was placed on a glass coverslip 
for imaging using a monochromatic camera (Point Grey Research, Flir Chameleon 
3). Pheromone was presented as follows: 1,11 of 7,11-HD or ethanol was pipetted 
onto a paper wick (Hampton Research) that had been trimmed such that one 
constituent fibre was exposed at the tip. Using a micromanipulator, the wick was 
brought into contact with one chemosensory sensillum on the 3rd tarsal segment 
of the foreleg. GCaMP responses were visualized using a 50x air objective using 
488-nm LED illumination on a bright-field microscope (Scientifica). AF/F values 
were calculated using ImageJ as the maximum signal in the 30s after pheromone 
presentation in accordance with published methods!**!. Without more precise 
genetic tools in D. simulans, we defined soma A as the soma that responded more 
strongly to 7,11-HD presentation, in accordance with previous work", To demon- 
strate that the response of soma A was specific to the pheromone, we also presented 
the ethanol vehicle in which ethanol alone was adsorbed to a wick. The range of our 
maximum AF/F values for 7,11-HD stimulation were consistent with previously 
published results in D. melanogaster!??8, 

To prepare flies for in vivo imaging of the LPC, P1 neurons and mAL neurons in 
the central brain using both frucalt and R25E04-Gal4 neural drivers (Figs. 4d-h, 5d, 
Extended Data Figs. 4g-j, 6g—h), CO2-anaesthetized 4-7-day-old males were 
affixed to a customized, milled tethering plate*? using UV-curable glue around 
their head and thorax. Glue was cured in short bursts to minimize exothermic 
damage to the preparation and flies whose legs touched the glue were discarded. 
The proboscis was glued to the head, carefully avoiding the antennae, to minimize 
movement of the brain during imaging. Flies were given an hour to recover and 
were used only if they displayed vigorous activity post-tethering. A small hole 
in the head was opened under external saline using sharp forceps. Muscle 16, 
obstructing trachea, and fat were removed. The imaging plate had magnets inside 
to enable facile positioning under the 40x objective in the two-photon micro- 
scope. Using a micromanipulator, a styrofoam ball” floating on an air stream was 
positioned under the fly to permit standing and walking. A D. melanogaster or 
D. simulans virgin female tethered to a pin (see above for tethering details) was 
positioned in front of the tethered male using a micromanipulator. To stimulate 
tapping events, the female was moved in front of the male fly, who then freely 
tapped on her abdomen with his foreleg tarsi. The male fly was imaged from the 
side (see above) to facilitate positioning the ball and the abdomen stimulus during 
the experiment. After 4 to 5s of baseline recording, the stimulus fly was presented 
to the tethered male for 2-5s, allowing for several taps before being withdrawn. 
This was repeated nine times for each fly stimulus with D. melanogaster and 
D. simulans stimuli interleaved. An ROI was centred on the LPC or on the fascic- 
ulated projections from P1 neuron cell bodies to the LPC. We vetted our ability to 
reproducibly identify the characteristic processes of P1 neurons by first imaging 
them using R71G01-Gal4>UAS-GCaMP in D. melanogaster males (Extended Data 
Fig. 4g), which showed robust and specific responses in all parts of the P1 neurons 
to the taste of a D. melanogaster female. When imaging the fasciculated projections 
of P1 neurons, our field of view contained both the LPC and the P1 projections, 
enabling us to simultaneously record responses in the LPC and P1 neurons when 
the male tapped a female (Fig. 4g, h). We attempted to use the R71G01-Gal4 driver 
for functional imaging of the P1 neurons in D. simulans, but observed no response 
to the taste of either a D. melanogaster or D. simulans female. Although this is 
consistent with the lack of pheromone-mediated excitation that we observed when 
imaging all Fru* neurons in the LPC or Fru* P1 neurons in D. simulans, we could 
not rule out that the lack of responses was due to weak expression of GCaMP. 
Notably, we observed pheromone responses using driver lines with similar levels 
of expression, such as R25E04-Gal4, in both D. melanogaster and D. simulans. 

For experiments in which picrotoxin was used to block inhibition (Extended 
Data Fig. 6h), in vivo responses were recorded in the LPC before and after ion- 
tophoresis of picrotoxin unilaterally into the LPC (1 mM in saline, 3-5 pulses, 
100 ms at 20 V). Local injection of picrotoxin had no noticeable effect on the behav- 
iour of the male fly or on baseline fluorescence of the LPC, in contrast to bath 
application of picrotoxin (101M and 100,1M), which caused seizures in the fly 
and a marked, fluctuating increase in baseline fluorescence of the LPC (data not 
shown). Iontophoresis of saline had no effect on pheromone-evoked responses in 
either species (data not shown). Picrotoxin iontophoresis was based on previously 
published methods*!, We did not attempt picrotoxin iontophoresis with D. simulans 
R71G01-Gal4 because we could not confidently identify the LPC owing to weak 
expression of GCaMP. 

To prepare flies for in vivo imaging of vAB3 (Fig. 5c, Extended Data Fig. 6a-d), 
2-5-day-old male flies were briefly anaesthetized using CO, and then tethered used 
a previously described preparation®? in which the male was affixed to a piece of 
tape covering a hole in the bottom of a modified 35-mm Petri dish using human 
hair placed across the cervical connectives. A small strip of tape was placed over 
the proboscis of the fly and two pieces of putty were placed next to the thorax to 
prevent the legs from becoming stuck onto the tape. A small hole above the head 
was precisely cut into the tape and the head was secured using two small dots of 
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Uv-curable glue that bridged the eyes and the tape. The dish was filled with saline 
and the head capsule was opened by carefully tearing off the flap of cuticle covering 
the dorsal portion of the head and removing any obstructing trachea and fat. The 
dish was placed under the microscope and the axonal tract of vAB3 projecting from 
the SEZ to the LPC was identified. We vetted our ability to reproducibly identify the 
characteristic morphology of vAB3 by first imaging the vAB3 axonal tract using 
AbdB-Gal4>UAS-GCaMP in D. melanogaster males (Extended Data Fig. 6a, b). 
Baseline fluorescence was recorded for 4 before a female abdomen was presented 
to the male for him to tap. Trials were repeated three times for each female species 
and then AF/F responses were averaged. 

Ex vivo stimulation of vAB3 (Fig. 5e, f, Extended Data Fig. 7) was performed 
as previously described”. A Grass stimulator was used to iontophorese acetyl- 
choline (10 V, 200 ms) through a fine glass electrode positioned on the axons of 
the Ppk23* sensory neurons in the VNC (Fig. 5e, f, Extended Data Fig. 7). The 
stimulating electrode was filled with 10mM acetylcholine, 10mM glutamate or 
saline and Texas-Red Dextran BSA to facilitate positioning the electrode in the 
Fru* neuropil. The local nature of the stimulation combined with the anatomically 
segregated sensory innervation of the Ppk23* sensory neurons in the VNC facili- 
tated restricted and reproducible stimulation. To functionally visualize responsive 
neurons in the brain, we imaged a z-plane every 5,1m and combined these to build a 
volume of the anterior around 100 1m of the brain. For quantitative comparisons of 
vAB3-evoked activity in specific neural populations across individuals (Fig. 5f, 
Extended Data Fig. 7), single z-planes were recorded using a 40 x objective at 
2x zoom with an ROI of 300 x 300 pixels. Given that P1 soma and fasciculated 
processes reside on the posterior side of the brain, when imaging P1 and vAB3 
neurons in response to vAB3 stimulation, we rotated the brain 180° around the 
cervical connectives. 

For two-photon severing of mAL (Fig. 5f, Extended Data Fig. 7), the brain was 
pinned ventral side up and we focused 925-nm light on a small ROI encompassing 
only the mAL axon tract at 8x optical zoom. The mAL axon tract could be readily 
identified by its characteristic morphology. For two-photon severing of vAB3 or 
a mock Fru* neuron (Extended Data Fig. 7c), the VNC and brain were pinned 
ventral side up. We validated that vAB3 axons could be reproducibly identified 
within the VNC by performing initial experiments in AbdB-Gal4>UAS-Tomato/ 
fru'4>LexAOP-GCaMP D. melanogaster males in which vAB3 neurons are ana- 
tomically marked by Tomato expression. We found that vAB3 axons were always 
robustly activated by acetylcholine iontophoresis and have a characteristic posi- 
tion within the ventral cord that enabled their identification even in the absence 
of an anatomic marker. We focused 925-nm light on a small ROI encompassing 
either the vAB3 axon tract or the tract of a Fru* neuron more lateral than vAB3. 
We then switched the laser to 850nm and imaged using short (<1) pulses until 
a cavitation bubble was observed. After switching back to 925 nm and zooming 
out, if the axon tract was successfully severed, we observed a marked increase in 
baseline fluorescence due to Ca”* influx into the neurons and activating GCaMP. 
Because vAB3 neurons project bilaterally, we also severed the corresponding axon 
tract on the contralateral side of the brain. To image P1 neurons after severing 
mAL, we re-pinned the brain such that the dorsal side of the brain and the ven- 
tral side of the VNC were facing up, rotating around the cervical connectives, 
inserted the stimulating electrode in the VNC and recorded activity in P1 neurons 
and vAB3 neurons. vAB3 activation was not affected by mAL severing 
(data not shown). 

Dye-filling of neural tracts using Texas-Red Dextran (100 mg ml 1, Invitrogen) 
was performed as previously described™. For dye-filling specific neural popula- 
tions, we targeted the fasciculated bundle of P1 neurons projecting from the somata 
(Fig. 4a, h, Extended Data Fig. 5a), the segregated vAB3 terminals in the VNC 
(Fig. 5b, Extended Data Fig. 5b) and the characteristic mAL axonal bundle pro- 
jecting between the SEZ and LPC (Fig. 5b, Extended Data Fig. 5c) with a dye-filled 
electrode. To photolabel neurons, we expressed sPA-GFP in Fru* neurons, located 
the neural structure of interest using 925-nm laser illumination, a wavelength 
that does not cause substantial photoconversion, defined an ROI in PrairieView 
software in a single z-plane, and exposed the target area to 710-nm light (around 
10-30 mW at the back aperture of the objective) 100-300 times over the course 
of several rounds of photoactivation. We allowed diffusion of the photoconverted 
fluorophores throughout the targeted neurons for 30-60 min and then imaged 
at 925nm. All anatomical images are maximum projections of z-stacks with 
1-\um steps. Autofluorescence from the glial sheath and basal fluorescence from 
non-dye-filled structures were masked for clarity. 

Unless stated, anatomical images were acquired on the 2P microscope using 
standard techniques. 
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Imaging analysis. To analyse functional responses, we calculated AF/F for each 
frame of calcium-imaging time-courses, using the second to sixth frames as the 
baseline. The peak AF/F value was defined as the maximum value within the time 
during which the stimulus was presented, which could vary due to inconsistent 
tapping rate of a male fly. Heat maps (AF) were generated using the maximum 
projection of two frames of baseline subtracted from the maximum projection 
of the two frames encompassing the peak fluorescence in response to a stimulus. 
Arbitrary units (AU) correspond to 1/100th of the minimum displayed value and 
maximum displayed value in the display range set in FIJI. 

Statistics and reproducibility. For courtship indices, each data point represents 
a biological replicate of an individual male’s courtship behaviour. For in vivo and 
ex vivo functional assays (Figs. 3b-d, 4e, f, 5c, d, f; Extended Data Figs. 3f, 6b, c, h, 7c, e); 
each pair of dots represents the average response for a given preparation (biological 
replicate). In Fig. 4g, h and Extended Data Figs. 3e, 4h-j, 6d, g, 7c-g, individual 
stimulations were plotted. Each representative example was replicated more than 
five times in independent experiments (Figs. 2a, 3a—c, 4a, e-h, 5b, e; Extended Data 
Figs. 2b-g, 3a-d, 4a, g, 5, 6a, e, 7a-c and Supplementary Videos 1-5). Sample sizes 
were based on pilot experiments. For all behavioural experiments n > 15, and for 
all in vivo imaging experiments n > 5. We used the PRISM software package to 
graph and statistically analyse data. Before statistical analysis, we tested whether the 
values were normally distributed using D’Agostion—Pearson omnibus and Shapiro- 
Wilk normality tests. When data were normally distributed, we used parametric 
tests; when data were not normally distributed, we used non-parametric tests. We 
adjusted P values accordingly when multiple comparisons were conducted. See 
Supplementary Table 1 for more details on statistical tests and P values. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. Datasets generated during the current study are available from 
the corresponding author on reasonable request. 
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Extended Data Fig. 1 | Pheromone regulation of D. simulans courtship. 
Mutant males and males lacking foreleg tarsi still court, but display altered 
courtship preferences. a, Courtship indices of males with foreleg tarsi 
intact (+) or surgically removed (—). Data are replotted from Fig. 1b. 

b, c, Courtship indices of D. melanogaster (b) and D. simulans (c) males 
with either foreleg tarsi or rear leg tarsi ablated towards D. melanogaster 
or D. simulans females. d, Schematic of CRISPR—Cas9-induced mutations 
(top) in Gr32a (left) and ppk23 (right) gene loci. Cas9 was targeted 

by gRNA to the first exon (cut site) of Gr32a or ppk23 resulting in a 

36-bp insertion/2-bp deletion in the Gr32a coding sequence and 90-bp 
insertion into the ppk23 coding sequence. Both indels generated in- 
frame stop codons (bottom, asterisk highlighted red in resulting amino 
acid sequence). Forward (F) and reverse (R) genotyping primers are 
marked with a line. e, Courtship indices (CI) towards females of different 


Drosophila species by wild-type (WT), Gr32a~/~ and ppk23~/~ D. simulans 
males. The major pheromone carried by each female is listed. f, Courtship 
indices of D. simulans males towards D. melanogaster and D. simulans 
females in preference assays. Data are replotted from Fig. 1d. g, Courtship 
indices of D. simulans males towards D. simulans females perfumed with 
7,11-HD (7, green) or ethanol (E, EtOH, blue). Data are replotted from 
Fig. le. The statistical test performed in e was a Kruskal-Wallis test, and 
different letters mark significant differences. Data are mean and s.d., with 
individual data points shown. Lines connect courtship indices of the same 
male towards the different female targets in a preference assay. Because the 
male can only court one female at a time, the paired points are inherently 
interdependent on each other, therefore inappropriate for statistical 
analysis. See Supplementary Table 1 for details of statistical analyses. 
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Extended Data Fig. 2 | Anatomic and functional conservation of 
Fru* neurons. a, Left, schematic of chromosomal location of fru"? 

and fru~'~ integration sites in D. simulans and previously generated 
fruS*" and fru’ transgenes in D. melanogaster; right, schematic of attP 
oligonucleotide integrated into the fru intron to generate fru” allele and 
subsequent integration of attB plasmids (right). ExF and ExR are primers 
located in the genome and InR is a primer located inside the transgene. 
b-g, Maximum intensity confocal (b-d) and two-photon stacks (e-g) 

of anatomically defined regions of Fru* neuropil in D. melanogaster 
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frusets UAS-GCaMP and D. simulans fru©!? males: LPC (b, c), SEZ (d), 
antennal lobe (e), lateral horn (f) and mushroom body 4-lobes (g). 

Scale bars, 101m. h, D. simulans fru~'~ was generated by integrating an 
oligonucleotide that deleted codons 1 and 2 of the first exon, introducing 
a frame-shift mutation. i, Male-male chaining indices of wild-type (+/+) 
and fru"! ~ males. A paired t-test was used, data are mean and s.d. and 
individual data points are shown. See Supplementary Table 1 for details of 
statistical analyses. 
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Extended Data Fig. 3 | Conserved anatomy and functional tuning 

of Ppk23*Fruitless* foreleg sensory neurons. a—c, ppk23 promoter 
expression in D. melanogaster and D. simulans males in forelegs (a), ventral 
nerve cord (VNC, b) and brain (c). GFP, green; differential interference 
contrast, grey; neuropil counterstain, magenta. a, Forelegs (top left) with 
paired soma (bottom left) and quantification of number of Ppk23* sensory 
neuron soma in the first three tarsal segments of the foreleg (right). 

d, Ppk23 neuron innervation in the first thoracic ganglion of the VNC 

of D. simulans females. Ppk23* sensory neurons display a characteristic 
sexually dimorphic expression pattern in the VNC, where they do not 
cross the midline in females, but do in males. e, Schematic of VNC 
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imaging preparation (left); functional responses evoked by individual taps 
of a female abdomen in Frut neurons (middle) and Ppk23* neurons 

(far right) in the VNC of wild-type and ppk23~’~ D. melanogaster and 

D. simulans males. Data replotted from Fig. 3b-d. f, Schematic of Ppk23* 
somatic imaging preparation (left) and functional responses of the paired 
neurons (cell A and B, see Methods) within a sensory bristle stimulated 
with 7,11-HD or ethanol (middle); comparison of 7,11-HD responses 

in Ppk23+ soma across species (far right). Statistical tests used were an 
unpaired t-test (a), a Kruskal-Wallis test, for which different letters mark 
significant differences (e), and paired and unpaired t-tests (f). Scale bars, 
10m. See Supplementary Table 1 for details of statistical analyses. 
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Extended Data Fig. 4 | Behavioural and functional analysis of P1 
neurons. a-f, Anatomy (a) and optogenetic behavioural manipulations 
(b-f) of P1 neurons using R71G01-Galé4 to drive the expression of 
CsChrimson. b, Courtship indices (top, right) towards a rotating magnet 
by D. melanogaster males pre-, during, and post-P1 neuron optogenetic 
stimulation. Fraction of male flies courting (bottom; grey boxes indicate 
bright light illumination, see Methods). c, Courtship indices towards 

a magnet moving at different speeds during optogenetic P1 neuron 
stimulation in D. simulans males. d-f, Comparison of courtship indices 
towards magnet (d), D. simulans female (e), or D. melanogaster female (f) 
by D. simulans males of denoted genotypes, either fed or not fed retinal. 


g, Experimental set-up used to measure pheromone responses in the P1 
neurons in vivo (top left) and overlay of the Fru* (green) neurons and 
fasciculated P1 neuron processes (magenta). The white boxes indicate the 
approximate ROI used to image P1 responses. h-j, Functional responses 
evoked by individual taps of a female abdomen in P1 neurons (h) and 
Fru* neurons in the LPC (i, j) of wild-type and ppk23~/~ D.melanogaster 
and D. simulans males (data replotted from Fig. 4e, f). Statistical tests used 
were the Kruskal-Wallis test (b, df, h-j) and a one-way ANOVA (c). Data 
are mean and s.d., with individual data points shown. See Supplementary 
Table 1 for details of statistical analyses. 
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Extended Data Fig. 5 | Anatomy of P1, vAB3 and mAL neurons in just dye-filled neurons (black, middle-right) and photoactivated neurons 
D. simulans and D. melanogaster males. a—c, Detailed anatomic images in D. melanogaster LPC (black, right). d, Antibody staining of D. simulans 
of P1 neurons (a), vAB3 neurons (b) and mAL neurons (c). Schematic Frut neurons (anti-GFP, green) with anti-GABA (red) in the SEZ and the 
of neural anatomy (left), Texas-Red dextran dye-fill (red) in D. simulans LPC demonstrating that mAL neurons are GABAergic and thus inhibitory. 
fru? (green) males (middle-left), magnified view of labelled neurons in Scale bars, 101m. 


the LPC showing dye-filled neurons (red) and Frut neurons (green) or 
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Extended Data Fig. 6 | Pheromone responses in central neurons of 

D. melanogaster and D. simulans males. a, Schematic of in vivo 
preparation used to measure pheromone responses in vAB3 processes in 
the brain (top). Representative fluorescence increase of vAB3 responses 

in a D. melanogaster male evoked by tapping a D. melanogaster female 
(bottom left). GCaMP was expressed in vAB3 neurons using the AbdB- 
Gal4 driver. Anatomy of fasciculated vAB3 processes co-labelled by fru 
(green) and AbdB-Gal4 (magenta) in the same in vivo preparation used for 
imaging (right). The white box indicates the approximate ROI analysed 
for functional imaging. b-d, Functional responses evoked by the taste 

of female pheromones in the vAB3 processes of wild-type and ppk23~/— 
males. b, Functional responses evoked by individual taps in vAB3 neurons 
in D. melanogaster labelled using AbdB-Gal4. c, Average responses of 
vAB3 neurons in ppk23~/~ D. melanogaster and D. simulans males in 
response to the taste of D. melanogaster (m) and D. simulans (s) females. 
GCaMP was expressed in vAB3 neurons using fru°"*. d, Functional 
responses evoked by individual taps in vAB3 neurons in wild-type and 


ppk23~’~ mutant males. Data replotted from Fig. 5c and Extended Data 
Fig. 6c. e, Expression of 25E04-Gal4>UAS-GCaMP (green) with neuropil 
counterstain (magenta) in the brains of D. melanogaster (left) and 

D. simulans (right) males. f, Courtship indices towards conspecific females 
during optogenetic activation of mAL neurons in D. simulans males with 
parental controls. g, Functional responses evoked by individual taps in 
mAL neurons. Data replotted from Fig. 5d. h, Average AF/F responses 

in Fru* neurons of the LPC evoked by the taste of a female before and 
after local injection of picrotoxin, a GABA, receptor antagonist, into 

the LPC. In males of both species, application of picrotoxin increased 
responses only to D. melanogaster female stimuli. Lines connect average 
functional responses in the same male towards the different female targets. 
Statistical tests used were the Kruskal-Wallis test, in which different letters 
mark significant differences (b, d, f, g), and the paired t-test (c, h). Data 
are mean and s.d., with individual data points shown. See Supplementary 
Table 1 for details of statistical analyses. 
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Extended Data Fig. 7 | Functional responses of Fru* neurons to 
direct vAB3 stimulation in D. melanogaster and D. simulans males. 

a, Schematic and representative image depicting direct stimulation of 
vAB3 neurons by iontophoresis of neurotransmitter onto their dendrites 
within the VNC. Maximum intensity z-projection (top) and single z-plane 
(bottom) showing electrode placement in the VNC. The electrode is filled 
with neurotransmitter and Texas-red dye (red) to enable precise targeting 
in the Fru* neuropil (green). b, Representative multi-plane fluorescence 
increase of Fru™ brain neurons in D. simulans males when the VNC is 
stimulated with acetylcholine (left), glutamate (middle) and saline (right) 
iontophoresis. c, To test the necessity of vAB3 in propagating signals 

from the VNC to the higher brain, we compared response profiles in the 
brain before severing vAB3 (black), after severing a nearby Fru* axon 
(mock control, blue) and then after severing vAB3 axons (orange) 

in D. melanogaster males (top) and D. simulans males (bottom). 
Representative multi-plane fluorescence increase of Fru* neurons (left) 
shows the loss of evoked functional responses in the brain after severing 
vAB3. The graph (middle) depicts the relationship between average AF/F 
responses in vAB3 and mAL. Average AF/F responses of vAB3 (green) 


and mAL (red) neurons evoked by vAB3 stimulation, before and after 
severing vAB3 (right). Responses were lost in both neural populations in 
both species after vAB3 was severed but not in the mock control. d, The 
relationship of functional responses in the SEZ and LPC of D. melanogaster 
and D. simulans males evoked by direct vAB3 stimulation. The dots on the 
graph represent different stimulation intensities and the trend lines relate 
the responses of individual D. melanogaster (green) and D. simulans (blue) 
males. e, Left, the response of vAB3 (top) and mAL (bottom) axonal 
tracts in response to vAB3 stimulation in D. melanogaster (green) and 

D. simulans (blue) males. Coloured lines represent single stimulations and 
black lines represent the average. Right, peak AF/F plotted with coloured 
dots representing the average response per fly and data are mean and 

s.d. Individual data points are shown. f, g, Comparison of vAB3-evoked 
responses in D. melanogaster and D. simulans males in vAB3 neurons (f) 
and P1 neurons (g) before (+) and after (—) mAL severing. Statistical 
tests used were the Kruskal-Wallis test, in which different letters mark 
significant differences (c, f, g), and an unpaired t-test (e). Scale bars, 
101m. See Supplementary Table 1 for details of statistical analyses. 
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Cryo-EM structure of a fungal 
mitochondrial calcium uniporter 


Nam X. Neuyen!, Jean-Paul Armache*, Changkeun Lee!*®, Yi Yang!*, Weizhong Zeng’, Vamsi K. Mootha’, Yifan Cheng’, 


Xiao-chen Bai*>* & Youxing Jiang!?* 


The mitochondrial calcium uniporter (MCU) is a highly selective calcium channel localized to the inner mitochondrial 
membrane. Here, we describe the structure of an MCU orthologue from the fungus Neosartorya fischeri (Nf{MCU) 
determined to 3.8 A resolution by phase-plate cryo- electron microscopy. The channel is a homotetramer with two-fold 
symmetry in its amino-terminal domain (NTD) that adopts a similar structure to that of human MCU. The NTD assembles 
as a dimer of dimers to form a tetrameric ring that connects to the transmembrane domain through an elongated coiled- 
coil domain. The ion-conducting pore domain maintains four-fold symmetry, with the selectivity filter positioned at the 
start of the pore-forming TM2 helix. The aspartate and glutamate sidechains of the conserved DIME motif are oriented 
towards the central axis and separated by one helical turn. The structure of Nf{MCU offers insights into channel assembly, 


selective calcium permeation, and inhibitor binding. 


Mitochondria can take up large amounts of calcium from their envi- 
ronment, which can modulate ATP production, alter cytoplasmic Ca*+ 
dynamics, and trigger cell death!”. Calcium enters the mitochondrial 
matrix through the mitochondrial calcium uniporter, a highly selective 
calcium channel that is localized to the inner mitochondrial mem- 
brane’. In humans, the uniporter is a protein complex, or ‘uniplex} 
consisting of at least four components: the ion conducting pore 
(MCU)*®, an essential membrane spanning subunit (EMRE)’, and 
peripheral membrane gate-keeping proteins (MICU1 and MICU2)”*. 
Whereas MCU is found in all major eukaryotic taxa, EMRE is meta- 
zoan-specific and is required for the conductivity of MCU in these 
organisms®. By contrast, the MCU homologue in Dictyostelium, whose 
genome lacks EMRE, is sufficient to operate as a pore-forming chan- 
nel'®. Uniporter activity has diverged in fungi, even being lost entirely 
in certain lineages such as Saccharomyces cerevisiae''. In most fungi, 
genome sequence analysis indicates that the pore-forming MCU is 
the only component of the uniporter’” and is likely to represent the 
minimal channel component of the uniporter for calcium uptake. A 
recent study used functional mutagenesis to demonstrate that the MCU 
orthologue from the fungus Aspergillus fumigatus can mediate Ca* 
transport into mitochondria’. 

Notably, MCU exhibits no discernible sequence homology to other 
cation channels!?, making it difficult to predict its structure and func- 
tion using computational tools. Although the structures of various iso- 
lated MCU domains or the auxiliary component of the uniplex have 
been reported—the NTD of human MCU!*"!5, the pore domain of an 
N-terminal deletion of the C. elegans MCU (CMCU-ANTD)"®, and 
human MICU1!7—fundamental questions about the channel's assem- 
bly, gating, and ion permeation remain unanswered. The recent NVR 
structure of CMCU-ANTD revealed a pentameric channel pore with 
an occluded ion pathway, and EMRE was suggested to be the key to 
driving the pore from being in the occluded to an open, ion-conductive 
state!®; however, there are no structural data on metazoan MCU 
and EMRE together. Mounting evidence for the pathophysiological 


relevance of the uniporter complex’'* has provided further impetus 
for determination of the structure of the intact channel. To address 
these longstanding questions, we determined the cryo-EM structure 
of an MCU homologue from N. fischeri (Nf{MCU) and also performed 
in-depth functional studies on this channel. The structure defines the 
architecture of the channel subunit of the MCU and provides insights 
into channel assembly and function. 


Biochemistry and structure determination 

The architectural and functional complexity of metazoan MCU led us 
to take a reductionist approach and to characterize MCU orthologues 
from fungi, whose genomes encode only the ion-conducting pore sub- 
unit”, with the goal of understanding the basic principles that govern 
channel assembly and function. Among those tested, the 488-residue 
wild-type MCU orthologue from N. fischeri, N{MCU, showed the best 
biochemical properties in terms of protein expression and stability and 
was modified for structural studies (see Methods). Unlike the pen- 
tameric CMCU-ANTD from the NMR study'®, NfMCU purified as a 
tetramer in solution (Extended Data Fig. 1). Additionally, it conferred 
uniporter function to bacteria, and Nf{MCU proteoliposomes reca- 
pitulated the voltage-dependent Ca”* uptake property of the MCU” 
(see Methods and Extended Data Fig. 2). 

Determination of the structure of Nf{MCU using single particle 
cryo-EM was technically challenging for the following reasons: small 
protein size of 180kDa for a channel tetramer; symmetry mismatch 
between the pore and soluble domains; and an elongated shape with 
minimal features. In addition, detergent-solubilized NfMCU aggre- 
gated when frozen and NfMCU in amphipols exhibited orientation 
bias. We first overcame the stability issue by reconstituting the chan- 
nel into nanodiscs but, owing to the channel's flexibility in nanodiscs, 
whose size is much larger than the transmembrane part of the channel, 
we could only determine the structure to around 15 A resolution. This 
led us to attempt reconstitution using the newly developed scaffolding 
protein saposin (see Methods)”°, which we rationalized could adapt its 
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Fig. 1 | Cryo-EM structure of Nf{MCU. a, The overall cryo-EM map 
of Nf{MCU with density for saposin removed for clarity. b, Overall 
architecture of NfMCU, coloured by subunit and divided into major 
structural domains. c, Each NfMCU tetramer assembles as a dimer 


oligomerization to maintain Nf{MCU in an artificial lipid bilayer that is 
proportional to the channel’s smaller transmembrane domain. NEMCU 
reconstituted in saposin yielded a marked improvement in the structure 
to an overall resolution of 4.6 A; however, several parts of the channel 
were poorly resolved, probably owing to inaccurate particle alignment 
stemming from the inherently low contrast of small particles. Finally, 
we used a Volta phase plate to image the NfMCU saposin complex, 
which ultimately allowed us to determine the structure to an overall 
resolution of 3.8 A according to the gold-standard Fourier shell corre- 
lation (FSC) =0.143 criterion (Extended Data Fig. 3, Extended Data 
Table 1 and Methods). 

Consistent with our biochemical analysis of Nf&MCU in detergent, 
the EM map unambiguously showed tetrameric assembly of NfMCU 
(Fig. 1a). The map was of sufficient quality for model building of major 
parts of the protein (Extended Data Fig. 4 and Methods). However, 
parts of the N and C termini, as well as the long loop in the NTD, were 
disordered and were not modelled in the final structure (see Methods). 
The EM map also revealed clear electron density for the 81-residue 
saposin, which we modelled as poly-alanine (Extended Data Fig. 5). 
Six saposin molecules—each forming a C-shaped clamp oriented at an 
angle of approximately 45° to the central axis—encircle the lipid bilayer 
to stabilize NfMCU. Thus, our structure also reveals how saposin 
oligomerizes to maintain a membrane protein in lipid. 


Overall architecture of N[MCU 

NfMCU forms an elongated, three-tiered channel tetramer with each 
tier corresponding to a structural domain: the N-terminal domain 
(NTD), coiled-coil domain (CCD) and transmembrane domain (TMD) 
(Fig. 1b, Extended Data Fig. 6). Although NfMCU is tetrameric, it tran- 
sitions from being four-fold symmetric at the TMD to two-fold sym- 
metric in the NTD, which forms a dimer of dimers (Figs. 1c, 2a). Unlike 
the NMR structure of cCMCU-ANTD, which lacks the NTD and adopts 
a constricted architecture, Nf{MCU is architecturally open, with large 
fenestrations in the space defined by the CCD (Fig. 1b, Extended Data 
Fig. 7a). Sitting atop the NTD, the CCD resembles pillars supporting 
the TMD. The tight packing among the four NTD subunits, as well 
as the exceptionally long CC1 helix that extends into TM1, dictates 
the spacing among the four CCDs, thereby, creating a highly solvent- 
accessible space below the ion conduction pathway. 

There are several major differences between the structures of 
NfMCU and cMCU-ANTD (Extended Data Fig. 7a, b). In addition to 
their oligomeric states and overall architecture, the secondary structural 
elements between the two structures are also quite different. The CC1 
and TM1 form a single extended helix in Nf{MCU but are comprised 
of five short, broken helices in CMCU-ANTD (Extended Data Fig. 7b). 
These structural differences are unlikely to be caused by the different 
origins of the two channels, as the CCD, TMD and selectivity filter are 
highly conserved among MCU orthologues’? (Extended Data Fig. 6). 
We suspect that the missing NTD and the use of a denaturing detergent 


ARTICLE 


c d ~ 
ane = Py $ ° s 
C 25 TMi TMS o™1 
fs 5 8 Ss 2 
S 2-8 mus $ 
‘ I V 
JM 2 Scot i it 3 
aso ‘Sv 6ecc1 
Se “Ie 
e S a4 
cove (Po cc? Wc 
5 Ty 02 
a2 
N 
ot NID of 
N 
Protomer A Protomer B Protomer A ~~ Protomer B 


of dimers. Each dimer consists of two protomers adopting distinct 
conformations. d, Structural comparison of the two protomers in each 
MCU dimer. Dashed circles highlight the distinct conformation of the 
juxtamembrane loop (JML) in the two protomers. 


such as fos-choline to extract and purify protein from inclusion bodies 
in the NMR study contributed to these discrepancies. 


Luminal soluble domains 

The NTD and CCD constitute the luminal soluble domains of N{MCU. 
Despite the poor sequence conservation of the NTD among MCU ort- 
hologues’” (Extended Data Fig. 6), the structures of the NTDs from 
NfMCU and human MCU are strikingly similar'*’°. Consisting of 
seven 3-strands (31-7) and two helices («1 and 2), each NTD forms a 
barrel enclosed by the seven-stranded }-sheet and the al helix, with 
the «2 helix sitting atop the barrel (Extended Data Fig. 7b). Four NTDs 
form a well-packed tetrameric ring in a dimer of dimers configura- 
tion with two-fold symmetry (Fig. 2a). Inter-subunit interactions are 
defined by two types of interface. Interface I involves primarily hydro- 
gen bonding between main chain atoms on (5 of one subunit and 83 
of a neighbouring subunit (Fig. 2b), whereas interface II is stabilized 
by hydrogen bonding between polar side chains (Fig. 2c). A similar 
dimerization at interface I is also observed in the crystal structure of 
HsMCUnyrp (Extended Data Fig. 7c), thus highlighting the structural 
conservation between NfMCU and human MCU. The tetrameric 
assembly of the NTD implies that this domain has a role in maintaining 
the channel’s architectural integrity. However, its functional importance 
is not entirely clear on the basis of studies of human MCU!*"* and, thus, 
warrants further investigation. 

The CCD consists of four helices—CC1, a3, CC2 and a4 (Fig. 1d, 
Extended Data Fig. 6). The first coiled-coil (CC1) forms an exception- 
ally long a-helix that curves with respect to the central axis as it extends 
from the NTD towards the TMD, transitioning seamlessly into TM1. A 
juxtamembrane loop (JML)—approximately 10 residues—and the a3 
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Fig. 2 | Channel assembly and subunit interfaces at the NTD. a, Top 
view of surface-rendered Nf{MCU showing inter-subunit interactions at 
the NTD. b, Detailed view of atomic interactions at interface I in NPMCU 
consisting of hydrogen bonds (dashed lines) mainly between main chain 
atoms. ¢c, Atomic interactions at interface II consist of mainly side chain 
hydrogen bonds. 
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Fig. 3 | Channel assembly and subunit interfaces at the TMD. a, Left, 
surface-rendered model of Nf{MCU; right, zoomed-in view of the inter- 
subunit interactions at the TMD between TM2 of one subunit and TM1 
of a neighbouring subunit. b, Top view of Nf{MCU at the pore entrance 
(top) and exit (bottom), highlighting the unique arrangement of the 
transmembrane helices in channel assembly and the widening of the pore 
moving down the ion conduction pathway. 


helix connect the end of TM2 to the second coiled-coil (CC2) in two 
distinct conformations between two neighbouring subunits (Fig. 1d). 
In one subunit, the loop runs parallel to the membrane and the a3 helix 
runs almost parallel to CC1, while in the other subunit the loop extends 
nearly perpendicularly from the membrane and the a3 helix becomes 
more perpendicular to CC1. The CC2 helix runs nearly parallel to CC1 
to form a coiled-coil and the CCD ends with the short «4 helix that 
runs perpendicularly to the coiled-coil, jutting away from the central 
axis (Fig. 1d). 


Ion-conducting pore 

The NfMCU pore is architecturally unique as compared to classical 
two-transmembrane tetrameric cation channels such as KcsA?!, whose 
outer TM1 helix wraps around the inner pore-forming TM2 helix, 
generating an inverted teepee-shaped ion conduction pore. Pillared 
by four TM 1s, the transmembrane helices of Nf{MCU enclose a cone- 
shaped ion conduction pore with a wider opening at the luminal side 
(Figs. 3, 4a). Notably, the pore-forming TM2 makes very little contact 
with the TM1 from the same subunit, but rather interacts extensively 
with a neighbouring TM1 (Fig. 3a). Consequently, there is a wide gap 
between TM1 and TM2 within each subunit at the luminal half of the 
membrane, which we expect to be sealed by lipid. 

NfMCU also lacks the canonical re-entrant pore loop and helix that 
define the selectivity filter of tetrameric cation channels; instead, the 
filter residues constitute the first helical turn of TM2. The conserved 
acidic residues of the DIME motif, which is 36, DTME372 in Nf{MCU, 
point towards the central axis and are separated by about one helical 
turn (Fig. 4). Four D369 residues encircle the cytosolic pore entrance 
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Fig. 4 | Ion conduction pore of Nf{MCU. a, Ribbon diagram of the 
NfMCU pore domain with the ion conduction pathway rendered as grey 
mesh and one pair of the acidic filter residues (D369 and E372) shown as 
sticks. b, Pore radius along the central axis. c, Side view of the selectivity 
filter region with the front subunit removed for clarity, highlighting the 
arrangement of the DIME motif and the approximate cross distance 
measurements between D369 residues and Ca** coordination distances 
for E372 (shown as dashed lines). Green sphere represents Ca**. d, Top 
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of NfMCU with a diagonal distance of about 8.0 A between carboxy- 
late oxygen atoms. The side chains of E372 form the narrowest part of 
the filter, with the narrowest opposing carboxylate oxygens measur- 
ing about 4.5 A (Fig. 4b). As N{MCU was maintained in 1mM CaCl, 
throughout sample preparation, we also observed a strong electron 
density at the centre of the E372 ring, which we assigned as a Ca”* che- 
lated by two opposing E372s in a bidentate manner with ion-ligand dis- 
tances between 2.2 and 2.5 A, which are optimal for Ca** coordination 
(Fig. 4c, d). The protein packing around the filter region also provides 
insights into the evolutionary conservation of several residues close to 
the DIME motif. W368 intercalates between TM2 of each subunit with 
its indole ring oriented nearly vertically; its phenyl ring forms CH-1 
interactions with the highly conserved P373 in a stacked-like configu- 
ration’*”?, while the amino group of the pyrrole ring forms intra- and 
intermolecular hydrogen bonds with the carboxylate oxygen of E372 
(Fig. 4c, d). Replacement of W368 or P373 with alanine resulted in 
non-functional channels (Fig. 4e). Thus, the region constituting the 
selectivity filter of NFMCU is tightly packed and conformationally rigid. 

Beyond the selectivity filter, the pore opens into a large, mainly 
hydrophobic cavity that widens towards the base of the ion conduction 
pathway rather than constricting (as seen among canonical tetrameric 
cation channels) (Fig. 4a). Whether gating occurs within the ion- 
conducting pore is unclear and warrants further investigation. 
However, our in vitro °Ca?* flux assay and bacterial Ca’* uptake assay 
seem to suggest that NfMCU functions as a constitutively open channel, 
as the only condition required to stimulate Ca** uptake is an electro- 
chemical gradient. Our speculation is also supported by the general 
consensus that Ca?* activation of the human uniporter is mediated by 
MICU] and MICU2 rather the channel itself**"’. 


Ca*+ permeation and inhibitor binding 

The proximity of the two rings of acidic residues—D369s and E372s— 
within the selectivity filter of Nf{MCU suggests a two-site, single-file 
pore for selective Ca** permeation in MCU, which is conceptually sim- 
ilar to the classical ion conduction model for the voltage-gated Ca?* 
channel (Cay)”*”®. With a larger diameter, the ring of D369s at the cyto- 
solic entrance encircles a low-affinity Ca’* site (site 1) for hydrated ion. 
The site is likely to be non-specific, but to show a preference for Ca7* 
given the highly negative charge of the ring. The narrow ring consisting 
of E372s forms the high-affinity, Ca**-selective site (site 2) with the 
acidic side chains directly chelating Ca”*, analogous to the high-affinity 
Ca?* site formed by the EEEE locus of Cay channels””. In addition, 
the structural constraint of E372 forces the carboxylate side chains to 
be near each other, making an empty site 2 energetically unfavoura- 
ble owing to electrostatic repulsion. We suspect that, to mitigate this 
effect, site 2 is constitutively bound to Ca** with high affinity, which 
can be accomplished under physiological conditions because MCU has 
high affinity for Ca?+ (<2nM)*. Thus, a potential mechanism for Ca** 


Relative Ca?* uptake rate 


view of the selectivity filter region. E372 forms a hydrogen bond network 
with W368. Ca?+ is coordinated by a pair of E372s in a bidentate manner. 
Dashed circle marks the inter-subunit packing between W368 and P373. 
e, Functional activity of NEMCU W368 and P373 mutants expressed 

in Escherichia coli. Data shown as mean + s.e.m. (n = 3 independent 
experiments). Protein expression level was analysed by immunoblotting 
using E. coli RNA polymerase alpha (RpoA) as control (see Methods). The 
experiment was repeated three times independently with similar results. 
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Fig. 5 | Mechanism of ion transport and Ru360 inhibition. a, Proposed 
model of calcium permeation in Nf{MCU. b, Concentration-dependent 
calcium uptake in E. coli expressing Nf{MCU filter mutations. Data are 
normalized to the calcium uptake rate of wild-type NfMCU at 90 1M 
CaCl. c, Normalized concentration-dependent Ru360 inhibition of E. coli 
expressing Nf{MCU D369N. d, Representative calcium uptake trace for 
E. coli expressing NE(MCU D369E mutant with or without Ru360. Arrow 
indicates addition of 501M CaCl, to the reaction. Bar graph shows the 
rate of calcium uptake normalized to the uptake rate without Ru360 in 
the reaction. Data in b, c shown as mean + s.e.m. (n =3 independent 
experiments). 


transport in MCU is likely to follow a similar model to that proposed 
for Cay channels (Fig. 5a): Ca?* binds constitutively to site 2 under 
physiological conditions and blocks the permeation of Na‘ or K*. With 
increasing cytosolic Ca”* concentration, Ca** gradually occupies site 
1 and, thereby, provide the necessary electrostatic repulsion between 
sites 1 and 2 to lower the affinity of the site 2 Ca”*, allowing it to be 
readily knocked off and to move into the water-filled vestibule directly 
below the filter. This Ca? permeation cycle repeats as the Ca?* from 
site 1 hops to site 2, freeing site 1 to bind a new Ca*t. The presence of 
the low-affinity site external to the high-affinity site may also explain 
the inward rectification of Ca** permeation observed in MCU. 

The two-site model implies that the high affinity site 2 is essential for 
Ca** selectivity while the low affinity site 1 is less specific but necessary 
for efficient Ca** permeation. We tested the roles of the acidic residues 
at both sites for Ca** permeation in Nf{MCU using our bacterial uptake 
assay. We first examined the effect of neutralizing the two acidic resi- 
dues individually (D369N and E372Q). Consistent with observations 
in human MCU?*"*, the E372Q mutant no longer conducts Ca*t, and 
the D369N mutant exhibited a slight reduction in Ca”* uptake (Fig. 5b, 
Extended Data Fig. 8). We also investigated whether disruption of the 
ion coordination distance had any effect on channel function. Notably, 
a D369E mutant showed about 50% greater Ca”* uptake compared to 
the wild type, whereas the E372D mutant remained nonfunctional. A 
similar D261E mutation in human MCU—equivalent to the D369E 
mutation in NfMCU—showed a slight reduction in calcium uptake’®. 
Mutation of D369 to threonine or glutamine yielded nonfunctional 
channels. These data confirm that site 2 (E372), which is essential for 
specific Ca’* binding, is highly sensitive to mutations, whereas the 
non-specific site 1 (D369) is more tolerant to mutations. 

The structure of Nf{MCU also offers an opportunity to probe the 
mechanism of binding of Ru360, an analogue of ruthenium red that spe- 
cifically blocks Ca”* uptake by MCU. In human MCU, a serine (S259) 
two residues before the DIME motif was initially shown to be impor- 
tant for Ru360 binding’, as an alanine mutation at this site conferred 
resistance to the drug while still allowing calcium transport. In Nf{MCU, 
the equivalent residue is a glycine. Studies on CMCU-ANTD identified 
the aspartate of the DIME motif as a second key residue for Ru360 
binding*’, which we suspect is the primary Ru360 binding site for 
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NfMCU because the DIME motif is conserved across all MCU ortho- 
logues. Neutralizing the aspartate (D369N) weakened Ru360 blocking, 
whereas maintaining the charge but reducing the pore diameter at site 1 
(D369E) eliminated Ru360 blocking (Fig. 5c, d). This result confirms 
that D369 in N{MCU forms the primary blocking site for Ru360, whose 
binding is dictated by the charge as well as the pore size at site 1. 


Conclusion 

Thus, the structure of Nf{MCU provides a model of an intact mito- 
chondrial calcium uniporter determined to near-atomic resolution, 
providing new insights into channel assembly and function. Contrary 
to the structure of CMCU-ANTD, NfMCU functions as a tetrameric 
calcium channel, whose pore is poised for rapid ion conduction. In 
essence, the channel provides a rather finite barrier measuring about 
6 A—the distance separating the filter aspartate and glutamate—that 
Ca?* must traverse from the intermembrane space to the mitochondrial 
matrix. The combination of two ion binding sites and a huge electro- 
chemical gradient across the inner mitochondrial membrane provide 
the necessary driving force for MCU to selectively conduct Ca**. It is 
notable that early studies of calcium transport demonstrated that mito- 
chondria from diverse eukaryotes have uniporter activity''. However, 
the kinetics and apparent binding affinities of calcium uptake by fungal 
mitochondria are appreciably different from those of vertebrate mito- 
chondria'!. While Nf{MCU reveals the architectural arrangement of 
the basic channel component of the uniporter, further study is needed 
to address fundamental questions about metazoan uniporter function, 
such as how the metazoan MCU evolved to require EMRE for ion con- 
duction and how MICU1 and MICU2 confer Ca”*-dependent gating 
to the uniplex. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0333-6. 
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METHODS 

Expression and purification of NEMCU. The construct for Nf&MCU (NCBI acces- 
sion number: XP_001266985.1) was codon optimized for expression in E. coli and 
cloned into the pCOLD II vector*”. For functional assay, the full-length protein 
with only the mitochondrial targeting sequence removed (Extended Data Fig. 5) 
was modified to contain an N-terminal 6x-His tag and a thrombin cleavage site 
(LVPRGS) and then reinserted back into the pCOLD II vector (Takara) using the 
restriction sites NdeI/Xhol. For structural studies, the Nf{MCU construct used in 
the function assay was further truncated by an additional 10 residues each from the 
Nand C termini and then cloned into the pCOLD II vector using the same restric- 
tion sites. Although no significant functional difference was observed between 
the full-length protein and the truncated channel (Extended Data Fig. 1b), the 
truncated construct yields a significantly more monodispersed profile on gel filtra- 
tion and was subsequently used for structural studies (Extended Data Fig. 1a). All 
single-site mutants were generated using the Quikchange Site-Directed 
Mutagenesis Kit (Agilent) and confirmed by DNA sequencing. 

E. coli (BL21 DE3) were freshly transformed, plated onto LB/ampicillin 
(LB/Amp) agar, and grown overnight. Colonies were harvested directly from 
LB/Amp plates to seed LB media (100 j1g/mL ampicillin) and grown in shaker 
flasks at 37°C in an orbital shaker until the cells reached an OD¢9 = 0.37-0.4, at 
which point they were transferred to another orbital shaker pre-chilled to 15°C 
and allowed to grow for another hour before protein expression was induced by 
addition of IPTG to a final concentration of 0.4mM. Cells were then grown at 
15°C for 16-18h before being harvested. The cell pellets were flash frozen in liquid 
nitrogen and stored at —80°C. 

To purify Nf{MCU, the cell pellets were resuspended in lysis buffer (50 mM Tris 
pH 8, 300 mM NaCl, 1 mM CaCl,), supplemented with protease inhibitors (1 ug/mL 
each of DNase I, pepstatin, leupeptin and aprotinin, and 1mM PMSF), and then 
homogenized on ice. To remove inclusion bodies and misfolded protein, the 
homogenate was centrifuged at 10,000 x g for 20 min at 4°C. The supernatant was 
harvested and NfMCU was extracted with 1% (w:v) n-dodecyl-6-D-maltopyra- 
noside (DDM, Anatrace) by gentle agitation for 1h on ice. After extraction, the 
supernatant was collected following a 30-min centrifugation at 21,000 x g at 4°C 
and then incubated with Ni-NTA resin (Qiagen) with gentle agitation on ice for 
1h. The resin was collected on a disposable gravity column (Bio-Rad), washed with 
wash buffer A (lysis buffer+ 1 mM DDM) for 3 column volumes and then with 
wash buffer B (lysis buffer+ 1mM DDM +30 mM imidazole) for 10 column vol- 
umes. NfMCU was eluted with wash buffer C (lysis buffer + 1mM DDM + 300mM 
imidazole). Thrombin was added at a ratio of 1 unit of thrombin per 1L culture 
of bacterial cell pellet; the sample was allowed to incubate at 20°C, overnight, to 
remove the His-tag. Nf{MCU was then concentrated and further purified by size 
exclusion chromatography on a Superdex 200 10/300 GL column (GE Heathcare) 
pre-equilibrated with buffer D (20 mM HEPES pH 7.5, 300mM NaCl, 1mM CaCh, 
1mM DD). The main peak (eluting around 11 mL) was then collected for further 
biochemical and structural analysis. 

Reconstitution of Nf{MCU into saposin. We were unsuccessful in reconstituting 
NfMCU following the original protocol”, likely due to the recommended step of 
heat shocking the sample during reconstitution. We thus made some modifications 
to the protocol, which we believe is better suited for more sensitive or less stable 
membrane proteins. Our reaction consists of the following components mixed at 
a molar ratio of 1:3:10 (N{MCU: saposin: E. coli total lipids). As a substitution for 
the recommended heat-shock step”, saposin was destabilized by incubation with 
sodium cholate (15 mM final concentration) with rotation at 4°C for 1h. Purified 
NfMCU (concentrated to 100|1M) was then added to the reaction and allowed to 
rotate at 4°C for 1h before addition of BioBeads (BioRad) to remove detergents. 
The reaction was incubated with BioBeads for 3-4h at 4°C, and BioBeads exchange 
was performed a total of 3 times with each incubation step lasting 3-4h at 4°C. 
The reconstituted Nf{MCU saposin complex was separated from empty saposin 
complex and aggregates by gel filtration chromatography on a Superdex 200 10/300 
GL column (GE Heathcare) pre-equilibrated with buffer E (20 mM HEPES pH 7.5, 
300mM NaCl, 1mM CaCh, 2% glycerol). The main peak (eluting around 11 mL) 
was then collected for structural analysis. 

EM data acquisition. Prior to freezing grids, the Nf&MCU saposin complex (con- 
centrated to 0.6 mg/mL) was incubated with 501M Ru360 on ice for 30 min. The 
cryo-EM grids were prepared by applying 3 1L of the Nf&MCU-saposin-Ru360 
complex to a glow-discharged Quantifoil R1.2/1.3 300-mesh gold holey carbon 
grid (Quantifoil, Micro Tools GmbH, Germany) and blotted for 4.0s under 100% 
humidity at 4°C before being plunged into liquid ethane using a Mark IV Vitrobot 
(FEI). Micrographs were acquired on a Titan Krios microscope (FEI) operated 
at 300kV with a K2 Summit direct electron detector (Gatan), using a slit width 
of 20eV on a GIF-Quantum energy filter as well as a Volta phase plate following 
the same approach for phase plate data collection as previously described**. EPU 
software (FEI) was used for automated data collection following standard FEI 
procedure. A calibrated magnification of 59,523 x was used for imaging, yielding 
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a pixel size of 0.84 A on images. The defocus was set at —0.5 1m. Each micrograph 
was dose-fractionated to 30 frames under a dose rate of 4e7 /pixel/s, with a total 
exposure time of 12s, resulting in a total dose of about 60 e /A2. 

Image processing. For the NfMCU phase plate data set, motion correction was 
performed using the MotionCorr2 program*™, and the CTF parameters of the 
micrographs were estimated using the GCTF program*. All other steps of image 
processing were performed using RELION*. Initially, about 1,000 particles were 
manually picked from a few micrographs. Class averages representing projections 
of Nf{MCU in different orientations were selected from the 2D classification of the 
manually picked particles, and used as templates for automated particle picking 
from the full data set of 2,470 micrographs. The 526,554 extracted particles were 
binned 3 times and subjected to two rounds of 2D classification, and a total of 
355,929 particles were finally selected for 3D classification using the initial model 
generated by RELION as the reference. Two of the 3D classes showed good 
secondary structural features and their particles were selected, combined and 
re-extracted into the original pixel size of 0.84 A. After 3D refinement with C2 
symmetry imposed and particle polishing, the resulting 3D reconstructions from 
128,231 particles yielded an EM-map with a resolution of 3.8 A. The transmem- 
brane domain showed poor density, indicating a local structural heterogeneity. 
Therefore, we performed a focused 3D classification with density subtraction in 
order to improve the density of the transmembrane domain’”. In this approach, the 
density corresponding to the soluble domains of the channel as well as the belt-like 
density from saposin was subtracted from the original particles. The subsequent 
3D classification on the modified particles was carried out by applying a mask 
around the transmembrane domain and having all the orientations fixed at the 
value determined in the initial 3D refinement. After this round of classification, 
one class (83,343 particles) with better density in the transmembrane domain was 
selected for 3D refinement, yielding an EM map of 3.8 A in which the entire trans- 
membrane domain can be modelled. All resolutions were estimated by applying a 
soft mask around the protein density and the gold-standard FSC = 0.143 criterion. 
ResMap** was used to calculate the local resolution map. 

Model building, refinement, and validation. De novo atomic model building 
was conducted in Coot. Since the crystal structure of the NTD of human MCU" 
(PDB: 4XTB) fits nicely in the EM map, it was used as a guide to build the NTD of 
NfMCU. Amino acid assignment was achieved based on the clearly defined density 
for bulky residues (Phe, Trp, Tyr, and Arg). Models were refined against summed 
maps using phenix.real_space_refine*®, with secondary structure restraints applied. 
The model was validated using previously described methods to avoid overfit- 
ting”. The 3.8 A EM density map of NfMCU allowed us to construct a model 
containing the following residues: chain A, B: 124-200, 258-399, and 406-461; 
chain C, D: 124-196, 259-396, and 404-461. Although we added Ru360 in our 
sample with the hope of seeing how it blocks the channel pore, its density was not 
visible in the structure. One main reason could be the symmetry mismatch. As the 
MCU pore entrance has fourfold symmetry, Ru360 could bind to the pore in four 
different orientations, causing its density to be averaged out. The statistics for the 
model's geometries were generated using MolProbity**. Pore radii were calculated 
using the HOLE program“. All the figures were prepared in PyMol* or Chimera*®. 
Bacterial calcium uptake assay. E. coli expressing Nf{MCU were grown as 
described above with the exception that protein induction was allowed to proceed 
for 16h, at which time the cells were harvested, centrifuged at 4000 x g for 10 min 
at 4°C, and the media was discarded. To remove residual media, the cell pellet was 
resuspended in ice-cold buffer (50 mM HEPES pH 7.4, 150mM KC]) and then 
centrifuged again (4000 x g for 10 min at 4°C) and the supernatant was discarded. 
To prepare the bacteria for calcium uptake measurements, the bacteria were treated 
as though they were mitochondria and prepared as previously described for the 
purification of mitochondria from yeast’” but with slight modifications. To remove 
residual Ca", the cells were resuspended to a final concentration of 10 mg/mL in 
ice-cold bacteria prep buffer (0.6 M D-mannitol, 10 mM KH2PO,, 20mM HEPES 
pH 7.4, 5mg/mL bovine serum albumin) plus 1mM EDTA and allowed to shake 
gently on ice for 30 min. To replenish any Mg”* that was chelated by EDTA treat- 
ment, 2mM MgCl) was then added into the cell resuspension followed by gentle 
shaking on ice for another 30 min. The cells were then centrifuged (4000 x g for 
10 min at 4°C) and the supernatant was discarded. The cells were centrifuged again 
(4000 x g for 3 min at 4°C) and residual buffer was removed with a pipette. The wet 
weight of the bacteria was measured with a digital balance (typical yield is ~80 mg 
of bacteria per 30 mL culture of bacteria). The cells were then resuspended in 
ice-cold reaction buffer (0.6 M D-mannitol, 20 mM HEPES pH 6.8, 10j.M EGTA, 
0.5 mg/mL bovine serum albumin) to a final concentration of 10 mg/mL and were 
used immediately for calcium uptake measurement. 

To measure calcium uptake, we relied on the highly negative membrane poten- 
tial across the plasma membrane of E. coli** and used the fluorescent calcium 
indicator Oregon Green BAPTA-6F (Life Technologies), which was added to a 
final concentration of 11M (from aliquots of a 500\1M stock kept at —20°C) to 
cells resuspended in reaction buffer, to monitor the Ca?* concentration changes 
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over time using fluorescence signal—a conventional approach for monitoring 
mitochondrial calcium uptake in the field*>. For calcium uptake measurements, 
100 1L of cell suspension was added to a 96-well plate (Corning) and fluorescence 
was monitored in a Molecular Devices SpectraMax M3 plate reader (excitation/ 
emission: 485 nm/535 nm). Baseline fluorescence was measured for all samples at 
the start of each reaction for 60s followed by addition of 5\1L CaCl, stock to the 
100L reaction in any experiment to reach the desired concentration. With the 
exception of the experiment in Fig. 1A, where the bacteria were pulsed 3 times 
with 20|1M CaCh, all measurements were from samples pulsed only once with the 
concentration of CaCl, indicated in the figure legend. Fluorescence was monitored 
for 4min to assess calcium uptake. 

For the inhibition experiments (either with GdCl; or Ru360), the inhibitor was 
added to 1000,1L of cell suspension to the highest desired concentration of inhib- 
itor for the experiment and this cell suspension was then used as a stock to dilute 
into a cell suspension without any inhibitor added (to a final volume of 100 1L) 
to obtain a series of cells in the desired inhibitor concentration. The samples were 
then pulsed with 50 |1M CaCl, and fluorescence was monitored for 4 min. 

To ensure protein expression was comparable among the Nf{MCU mutants 
tested in our uptake assay, western blot analysis was performed on each sample 
used for each experiment. Extended Data Fig. 8 shows representative western blots 
of an experimental trial using the NE(MCU mutants referred to in this study. Mouse 
anti-His antibodies (Qiagen) was used to detect Nf{MCU expression and mouse 
anti-E. coli RNA polymerase alpha or anti-RpoA (Santa Cruz Biotechnology) was 
used as a loading control. HRP-conjugated sheep anti-mouse antibodies were used 
as the secondary antibodies (GE Healthcare). 

All data were analysed in GraphPad Prism 7 (GraphPad Software, Inc.). To 
obtain the rate of calcium uptake, the linear phase of the uptake measurement—the 
first 45-60 s of the reaction following CaCl, administration—was determined by 
fitting the data to a linear regression equation in GraphPad Prism; the slope of the 
fit is taken as the rate of calcium uptake. Further analysis such as normalization are 
indicated in the figure legend. To obtain the ICsp for Ru360, the relative calcium 
uptake rates were fitted to a dose-response curve in GraphPad Prism. 
Radioactive “Ca flux assay. In light of the contradiction of the oligomeric state 
between Nf{MCU and cMCU-ANTD, we designed an in vitro functional assay by 
reconstituting our purified tetrameric channel into liposomes. Unable to obtain 
currents of Nf{MCU by electrophysiology—probably due to the channel’s small 
conductance—we performed a radioactive Ca”* flux assay on proteoliposomes 
loaded with 150mM KCl. 

NfMCU was reconstituted into lipid vesicles using a similar protocol as 
described”. E. coli total lipid (Avanti) was prepared following standard protocol 
and then solubilized in dialysis buffer (50 mM HEPES pH 7.4, 150mM KCl) sup- 
plemented with 40 mM n-decyl-$-D-maltopyranoside (DM, Anatrace). Purified 
NfMCU in DDM (at ~1 mg/mL) was mixed with the DM-solubilized E. coli total 
lipid at a ratio of 1:100 (w:w, protein:lipids). NEMCU and lipid mixture were incu- 
bated with gentle rotation at 4°C for 1h and then dialysed (using dialysis tubing 
with a molecular weight cutoff of 12,000-14,000 Da (Spectra/Por) against 1 L of 
dialysis buffer at 4°C to remove detergent. Fresh dialysis buffer (1 L each time) was 
exchanged every 18-20h for a total of 3 exchanges. Following dialysis, NIMCU 
proteoliposomes were divided into 100 1L aliquots, flash frozen in liquid nitrogen, 
and stored at —80°C. For all flux assays, three separate batches of bacteria were 
grown to express Nf{MCU. The protein was then purified individually and used 
immediately for reconstitution into liposomes. 

The Ca?" flux assay was performed following the same procedures as previ- 
ously described but with slight modifications’. NfMCU proteoliposomes were 
thawed and sonicated in a bath sonicator for 60s at 15s intervals before the assay. 
100 |1L samples were passed through a pre-spun Sephadex G-50 fine gel filtration 
column (1.5mL bed volume in a 5 mL disposable spin column) swollen in flux 
buffer (50mM HEPES pH 7.4, 150mM KCl or NMDG). The resulting 160 LL 
of proteoliposome eluent was collected and added to 300 LL of reaction buffer 


(flux buffer + 0.2 1M valinomycin + 83 1M *5CaCl,) to initiate the reaction. At 
various time points, 75\1L of this reaction mixture was passed through a pre- 
spun Sephadex G-50 fine gel filtration column as previously described to remove 
extra-liposomal *Ca** and stop the flux. The eluent was mixed with 10 mL scin- 
tillation cocktail and its radioactivity was measured in a scintillation counter. 

For the Ru360 inhibition experiments, NfMCU proteoliposomes in flux buffer 

(50mM HEPES pH 7.4, 150 mM NMDG) were mixed with reaction buffer contain- 
ing varying concentrations of Ru360 (dissolved in NMDG flux buffer + 0.2 |1M val- 
inomycin + 831M *°CaCl). The reaction was allowed to proceed for 30 min, after 
which the reaction was stopped by passing 751L of the reaction through a pre-spun 
Sephadex G-50 fine gel filtration column as previously described. To determine 
the relative calcium uptake by N{MCU proteoliposomes, the scintillation count 
for each sample was normalized to the sample without any Ru360 in the reaction. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 
Data availability. The cryo-EM density maps of the NfMCU have been deposited 
in the Electron Microscopy Data Bank under accession numbers EMD-7826 (with- 
out saposin) and EMD-7828 (with saposin, low-pass filtered to 5.0 A resolution). 
Atomic coordinates have been deposited in the Protein Data Bank under accession 
numbers 6D7W (without saposin) and 6D80 (with saposin modelled as poly-A 
peptide). Source data for Figs. 4e, 5b-d, Extended Data Figs. 1b, 2 are available 
in the online version of the paper. The gel source data for Fig. 4e, Extended Data 
Figs. 1b, c, 8 are available in Supplementary Fig. 1. 
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Extended Data Fig. 1 | Biochemical analysis of NfMCU. a, Representative 
gel filtration profile of full-length (black trace) versus truncated NEMCU 
used for cryo-EM studies (blue trace). b, Functional activity of full-length 
(FL) and truncated Nf{MCU (EM); data shown represent mean + s.e.m. 
(n=3 independent experiments). Cell lysates were analysed by 
immunoblotting using anti-His antibody to detect expression of NEMCU; 
RpoA was used as loading control. c, Detergent-solubilized Nf{MCU 
(control) separates as a single band at ~46 kDa on Coomassie-stained 


SDS-PAGE and crosslinks as tetramers when treated with chemical 
crosslinkers of varying spacer arm length. DSG, disuccinimidyl glutarate; 
DSS, disuccinimidyl suberate; BS3, bis(sulfosuccinimidyl) suberate; 
BSPEGs, PEGylated bis(sulfosuccinimidyl)suberate. See Supplementary 
Fig. 1 for gel source data. The experiments shown in a and b were repeated 
at least three times independently with similar results. The crosslinking 
experiment shown in c was performed once. 
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Extended Data Fig. 2 | Functional characterization of Nf{MCU. 

a, Calcium uptake in E. coli expressing empty vector (black trace) or 
NfMCU (blue trace), demonstrating that heterologously expressed 
NfMCU can confer uniporter function to bacteria. Arrows indicate 
addition of 201M CaCl, to the reaction solution. b, Concentration- 
dependent calcium uptake in E. coli expressing empty vector (black trace) 
or NfMCU (blue trace). Each point represents the rate of calcium uptake 
normalized to the uptake rate at 90 1M CaCl. c, Concentration-dependent 
inhibition of Nf{MCU by GdCl; in E. coli expressing NfMCU. Each point 
represents the rate of calcium uptake normalized to the uptake rate 
without GdCl, in the reaction. d, Concentration-dependent inhibition of 
Ca?* uptake in E. coli expressing Nf{MCU by the MCU-specific inhibitor 
Ru360. Each point represents the rate of calcium uptake normalized to 

the uptake rate without Ru360 in the reaction. The Ru360 concentration 
required to reduce the Ca?* uptake rate by half (ICs59) is calculated to be 
0.5 1M. e, In light of the contradiction of the oligomeric state between 
NfMCU and cMCU-ANTD, we also designed an in vitro functional assay 
by reconstituting our purified tetrameric channel into liposomes. Unable 
to obtain currents of N{MCU by electrophysiology—probably owing to the 


channel’s small conductance—we performed a radioactive Ca flux assay 
on proteoliposomes loaded with 150 mM KCl. Shown here is the time- 
dependent *°Ca?* uptake by empty liposomes or Nf{MCU proteoliposomes 
incubated with Ca’* in a reaction buffer containing 150 mM KCl (red 
points) or 150 mM NMDG (blue points). The result demonstrates that 

in a reaction solution containing 150 mM NMDG and the K*-selective 
ionophore valinomycin (to generate an electrical driving force), NfMCU 
proteoliposomes exhibited time-dependent “°Ca?* uptake while the 

same proteoliposomes in a reaction solution containing valinomycin and 
150mM KCI (to eliminate electrical driving force) accumulated *Ca?* 
only slightly better than empty liposomes, recapitulating the voltage- 
dependent Ca”* uptake property of the MCU. f, Concentration-dependent 
inhibition of Nf{MCU proteoliposomes by Ru360, demonstrating that 
Ru360 also blocks NfMCU “Ca?* uptake in a concentration-dependent 
manner up to 50%, which is expected given an equal distribution of two 
channel orientations in the liposomes. Each point represents radioactivity 
measured after 30 min reaction normalized to sample without Ru360. All 
data in a-f are shown as mean + s.e.m. (nm =3 independent experiments). 
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Extended Data Fig. 3 | Cryo-EM analysis of Nf{MCU. a, Representative section of the Methods. d, Local resolution of Nf{MCU estimated with 
phase-plate electron micrograph of the Nf{MCU saposin complex RELION2.0. e, FSC curves for the refined model versus the summed 3.8 A 
and 2,470 micrographs were used for structure determination. map (black curve), the refined model versus half map 1 (red curve), and 
b, Representative 2D class averages of the N{MCU saposin complex. the refined model versus half map 2 that was not used for refinement 
c, Flowchart of data processing to obtain the 3.8 A cryo-EM map of the (green curve). f, The gold-standard FSC curve for the cryo-EM map. 


NfMCU saposin complex. Details can be found in the Image processing 
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Extended Data Fig. 4 | EM map of Nf{MCU. a, Representative regions of (TM1 and TM2). Protomers 1 and 2 are shown in yellow and orange, 
the EM map of Nf{MCU highlighting key structural features at the NTD respectively. b, Stereo view of the EM map carved around the TM2 filter 
(al, «2, 82-83), the coiled-coil domain, and transmembrane domain region of Nf{MCU. 
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3 
Extended Data Fig. 5 | Cryo-EM structure of Nf{MCU in complex with wrap around NfMCU stabilized in lipid. b, Ribbon diagram of N{MCU 
saposin. a, Cryo-EM map of NfMCU in complex with saposin (shown in complex with saposin. Approximately 79-81 residues of the 81-residue 
alternating between purple and pink). The map is low-pass filtered to 5A saposin molecule were modelled as a poly-alanine chain into the 
to allow better visualization of the density from the saposin molecules. helix-turn-helix EM map density for saposin. 


Six saposin molecules, each oriented at a 45° angle to the vertical axis, 
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Extended Data Fig. 6 | Sequence alignment of MCU orthologues. The NfMCU (grey arrow) are predicted to be the mitochondrial targeting 
sequences were aligned using PROMALS3D™ and numbered according sequence (MTS). NCBI accession number for the sequences are: H¥MCU 
to NfMCU. Secondary structures shown above the sequences are based on (human MCU): NP_612366.1, MmMCU (mouse MCU): XP_006513531.1, 
the cryo-EM structure of Nf{MCU. Loops are denoted by solid black lines; DdMCU (D. discoideum MCU): XP_637750.1, CeMCU (C. elegans MCU): 
disordered regions not observed in the cryo-EM structure and truncations NP_500892.1, NCMCU (Neurospora crassa MCU): XP_959658.1, Nf{MCU 
made to NfMCU are indicated by dashed lines. The first 75 residues in (Neosartorya fischeri MCU): XP_001266985.1. 
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Extended Data Fig. 7 | Structural comparison between MCU domains. juxtamembrane helix; OJMH, outer juxtamembrane helix; L1, loop 1; L2, 
a, Overall structure comparison between tetrameric Nf{MCU and loop 2; JML, juxtamembrane loop; CC1, coiled-coil 1; CC2, coiled-coil 2. 
pentameric CMCU-ANTD (PDB: 51D3). Each subunit is individually c, Structural comparison between the NTDs of Nf{MCU and HsMCU 
coloured. b, Structure comparison between a single subunit of the (PDB: 4XTB). Dashed box indicates the major difference between the 
pore domain of Nf{MCU and cMCU-ANTD. Dashed boxes indicate two structures. d, Atomic interactions between two NTD subunits in the 
major differences between the two structures. Structural features in crystal structure of H3¥MCU NTD, equivalent to interface I identified in 
cMCU-ANTD are labelled as in the original structure’®. IJMH, inner the cryo-EM structure of Nf{MCU. 
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Extended Data Fig. 8 | Western blot analysis of Nf{MCU expression expression and mouse anti-RpoA targeting E. coli RNA polymerase 
in E. coli. N{MCU, with its mitochondrial targeting sequence removed alpha (Santa Cruz Biotechnology) was used for loading control. See 
and replaced with a 6 x His tag, was expressed in E. coli for functional Supplementary Fig. 1 for gel source data. The experiment was repeated 


analysis. Mouse anti-His antibodies (Qiagen) were used to detect NEIMCU three times independently with similar results. 
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Extended Data Table 1 | Cryo-EM data collection and model statistics 


Data collection and processing 
Magnification 
Voltage (kV) 
Electron exposure (e-/A’) 
Defocus range (ym) 
Pixel size (A) 
Symmetry imposed 
Initial particle images (no.) 
Final particle images (no.) 
Map resolution (A) 

FSC threshold 
Map resolution range (A) 


Refinement 
Initial model used (PDB code) 


Model resolution (A) 
FSC threshold 


Model resolution range (A) 
Map sharpening B factor (A’) 
Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 
B factors (A’) 
Protein 
Ligand 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 
Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 


Nf{MCU 


(EMDB-7826) 
(PDB 6D7W) 


128,231 
3.8 

0.143 
184.8 — 3.8 


Initial model 
generated in 
RELION2.0 
3.8 

0.143 

184.8 — 3.8 
-120 


8,946 
1,087 
1 
103.55 
103.55 
127.22 


NfMCU saposin 
complex 
(EMDB-7828) 
(PDB 6D80) 


128,231 
5.0 

0.143 
184.8 — 5.0 


Initial model 
generated in 
RELION2.0 
5.0 

0.143 

184.8 — 5.0 
-120 


11,351 
1,568 


135.43 
135.43 
127.22 
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Mitochondrial calcium uptake is critical for regulating ATP production, intracellular calcium signalling, and cell death. 
This uptake is mediated by a highly selective calcium channel called the mitochondrial calcium uniporter (MCU). Here, we 
determined the structures of the pore-forming MCU proteins from two fungi by X-ray crystallography and single-particle 
cryo-electron microscopy. The stoichiometry, overall architecture, and individual subunit structure differed markedly 
from those described in the recent nuclear magnetic resonance structure of Caenorhabditis elegans MCU. We observed 
a dimer-of-dimer architecture across species and chemical environments, which was corroborated by biochemical 
experiments. Structural analyses and functional characterization uncovered the roles of key residues in the pore. These 
results reveal a new ion channel architecture, provide insights into calcium coordination, selectivity and conduction, 
and establish a structural framework for understanding the mechanism of mitochondrial calcium uniporter function. 


Mitochondrial Ca?* uptake is pivotal for fundamental cellular 
processes such as ATP production, cell death, and cytosolic Ca2+ 
signalling'~>. Rapid mitochondrial Ca?* uptake is mediated by an 
ion channel called the mitochondrial Ca’* uniporter*®. The uni- 
porter resides in the inner mitochondrial membrane, is remarkably 
selective for Ca2*, has high conductance, and is inhibited by ruthenium 
red’, During cytoplasmic Ca’” signalling, local increases in the Ca”* 
concentration activate the uniporter®”, leading to substantial Ca?t 
influx. 

Recent studies identified and established MCU as the essential 
pore-forming subunit of the Ca** conduction channel!®!?. MCU 
shows higher-order oligomerization’*4 and bears no discernible 
sequence similarity to other known channels. It forms a multi- 
component complex with regulatory subunits, including MICU, 
EMRE!}3#!22, MCUb?3, and MCUR1!%45. MCU is ubiquitous in plants 
and metazoans and present in protozoans and fungi!°”®. By contrast, 
EMRE is found only in metazoans and MICU is absent from most 
fungi*!°. Malfunction or deregulation of the uniporter complex has 
been linked to tumour progression, muscle weakness, neurological 
defects, and impairments in tissue regeneration”. 

MCU is characterized by a highly conserved signature DXXE motif, 
two transmembrane helices, and flanking coiled-coils”®. Its N-terminal 
domain (NTD) shows reliable sequence conservation only among 
metazoan species. This observation raises the question of whether 
MCU has a unified overall architecture. Recent nuclear magnetic res- 
onance (NMR) studies of a Caenorhabditis elegans MCU (cMCU) core 
domain provided a first glimpse into the structure of MCU®. These 
investigations defined a pentamer architecture with a layered domain 
organization and revealed the formation of the inner core by trans- 
membrane helix 2 (TM2) and the pore-facing location of a glutamate 
residue that is critical for Ca2* uptake’s, However, the cMCU construct 
neither contained the NTD nor conducted Ca?* by itself. Furthermore, 
without an apparent Ca”* permeation route or defined chemistry of the 
selectivity filter in the NMR structure, the mechanism of Ca** coordi- 
nation and permeation remains unclear. 


Here, we determined the structure of MCU with all structural 
domains, capable of conducting Ca**. Unexpectedly, our X-ray 
crystallography and cryo-electron microscopy (cryo-EM) studies 
revealed a dimer-of-dimer assembly of MCU. Both the overall 
architecture and the structure of each protomer were distinct from 
those of the NMR structure of cMCU, giving rise to a completely 
different pore. Our structures define the key elements of the Ca?* per- 
meation pathway and provide insights into Ca”* selectivity. 


Functional characterization and crystal structure 

We identified two MCUs from fungi, Fusarium graminearum MCU 
(FgMCU) and Metarhizium acridum MCU (MaMCU) (Extended 
Data Fig. 1a), that showed excellent biochemical behaviour (Extended 
Data Fig. 1b, c). As EMRE is absent in fungi”! and another fungal 
MCU was shown to mediate mitochondrial Ca** uptake”, these MCUs 
were potentially sufficient for basic uniporter activity. In an Escherichia 
coli-based system with the fluorescent Ca?* reporter GCaMP2”°, 
cells with wild-type MaMCU (Fig. la) or FEMCU (Extended Data 
Fig. 1d) showed robust Ca”+ uptake. This uptake was diminished by 
mutation of residues critical for Ca** conductance’™”' or addition 
of the classic uniporter inhibitor ruthenium red (Fig. la, Extended 
Data Fig. 1d). Furthermore, when reconstituted in liposomes, 
purified wild-type MaMCU (Fig. 1b) and FgMCU (Extended Data 
Fig. le) showed robust ion conduction activity in a fluorescence-based 
flux assay* | The addition of Gd**, another MCU inhibitor, abolished 
this activity (Fig. 1b, Extended Data Fig. le). Thus, MaMCU and 
FgMCU recapitulate key features of MCU and MCU is sufficient to 
form a pore. 

We obtained diffracting crystals of MaMCU either by inserting 
rubredoxin into an NTD loop or with nanobody bound (Extended Data 
Fig. 2a, b, d and Extended Data Table 1). As crystal with nanobody dif- 
fracted X-rays better (to approximately 3.1 A) and showed considerably 
better electron density in the transmembrane region, it was used for all 
structural interpretations. With 12 mercury sites introduced individu- 
ally into the transmembrane region (Extended Data Fig. 2e) and clear 
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Fig. 1 | Functional characterization and crystal structure of MCU. 

a, Ca** uptake activity of MaMCU in E. coli. Fluorescence changes upon 
addition of 0.5 mM Ca?" are shown for wild-type (red), D333A/E336A 
(green), and control (black). Ruthenium red (RuR) abolished wild-type 
uptake (blue). AU, arbitrary units. b, Cation flux activity of MaMCU in 
reconstituted liposomes. MCU efficiently conducts monovalent cations 
(for example, Na*) when divalent cations are absent®. Vesicles containing 
150 mM Na* were assayed in buffer containing 150 mM N-methyl-p- 
glucamine (NMDG). Upon addition of proton ionophore (CCCP), Na* 
efflux through MCU drives CCCP-mediated proton influx, quenching 
9-amino-6-chloro-2-methoxyacridine (ACMA) fluorescence. Liposomes 
with MaMCU showed robust fluorescence changes (red); Gd?* addition 
abolished flux (blue). Na* ionophore (monensin) was added to make 
liposomes permeable to Na*. c, MaMCU crystal structure. Ribbon 
representations of MaMCU viewed from membrane (left) or cellular side 
(right). Each domain is coloured according to subunit diagram (bottom 
right). Data in a and b are representative of three independent experiments 
with similar results. 


side chain densities (Extended Data Fig. 2f, g), we could unambiguously 
register and build the side chains. 

In the crystal, two crystallographically related dimers interact tightly, 
forming one MCU channel with a two-fold symmetry axis along the 
Ca**-conducting pathway. MCU comprises three layers: a trans- 
membrane domain, a coiled-coil, and an NTD, forming an elongated 
shape (Fig. 1c). Notably, the oligomerization state, architecture, and 
single-subunit structure (including the selectivity filter configuration, 
packing mode between transmembrane helices, and overall fold) of 
MaMCU differed from the published pentameric structure of CMCU 
(Extended Data Fig. 3a, b). 


Transmembrane domain and ion conduction pore 

The transmembrane domain has an approximately four-fold symmetry 
and the shape of a truncated pyramid (Fig. 1c). The two transmem- 
brane helices (TM1 and TM2) connect by a short loop. TM1 is located 
on the outside and TM2 lines the central ion conduction pore, con- 
sistent with the observation that TM1 of human MCU mediates the 
interaction with EMRE”. TM1 interacts loosely with TM2 of the same 
subunit and packs intimately against TM2 from a neighbouring subunit 
(Fig. 1c), producing a sense of domain swapping in the transmembrane 
connectivity. In one pair of subunits, the connection between TM2 
and the second helix of the coiled-coil is not well ordered, indicating 
flexibility. 

The pore of MCU is narrow at the top. Notably, in contrast to the 
assumption that the signature motif DXXE lies within a loop, DXXE 
spans the first helical turn of TM2 with the strictly conserved Asp333 
and Glu336 constituting the first two pore-lining residues (Fig. 2d). 
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Fig. 2 | Ca”* permeation pathway. a, Electron density map around ion 
coordination region. Weighted 2F,—F, contoured at 30 (cyan) and Ca?* 
ion (pink sphere) are shown. b, Anomalous-difference Fourier map for 
Ca?* at 4.5 A contoured at 4c (pink mesh). c, Ca?+ coordination in filter 
region viewed from top. Refined model (ribbon) and selected residues 
(sticks) are shown. d, Ca” coordination in filter region viewed within 
membrane, with front helices removed for clarity. e, Surface representation 
viewed from top, coloured according to electrostatic potential (red, 
negative; blue, positive). f, lon permeation pathway. Ion-conducting 
passage (green dots) was calculated by HOLE. Pore radius is shown on 
right. 


Asp333 is located at the intermembrane end of TM2 and is solvent 
exposed. Its side chain density was less resolved than those of other 
key residues. In the modelled configuration, the Asp333 side chain 
carboxylate does not directly point to the pore centre, potentially pro- 
viding negative charge for Ca** recruitment (Fig. 2e). Alternatively, if 
Asp333 takes the most common rotamer position, which points into 
the pore, its carboxylate ring could coordinate a hydrated Ca?*, given 
a diameter of approximately 8 A. One helical turn below Asp333 is 
Glu336, whose carboxylate group points to the pore centre (Fig. 2a, c). 
We observed a strong density along the central axis, consistent with a 
bound ion surrounded by the carboxyl oxygens of Glu336 (Fig. 2a). 
As the protein was purified and crystallized in the presence of 2mM 
Ca** and no other divalent ions, we assigned Ca?” at this location, 
which was further confirmed by a strong anomalous difference peak 
occupying the same location (Fig. 2b). The diameter of the Glu336 
carboxylate ring is approximately 5 A, too small for hydrated Ca?* to 
go through. The observed Ca”+-oxygen distance (around 2.4-2.7 A; 
Fig. 2d) is consistent with direct coordination**. We thus propose that 
Ca’* passes through a selectivity filter formed by the glutamate ring 
in a dehydrated form, similar to the direct coordination of Ca”* by 
four aspartate side chains in TRPV6°?. Between the side chains of two 
neighbouring Glu336 is the indole ring of Trp332, which stabilizes the 
closely spaced carboxy] side chains of Glu336 through hydrogen bonds 
or anion-7 interactions (Fig. 2c). Trp332 also forms a close stacking 
interaction with Pro337 that is important for orientating Glu336 for 
Ca?* coordination. The intimately interacting Trp332, Glu336, and 
Pro337 are thus essential for proper configuration of the selectivity 
filter, consistent with their strict conservation across species. 

One helical turn below Glu336 (and two below Asp333) is Tyr340. 
The aromatic side chain of Tyr340 is lined along the wall of the pore, 
forming a passage about 12 A wide. Below the tyrosine, the pore is 
relatively wide open (Fig. 2f), consistent with the high conductance of 
MCU. Near the matrix side, TM1 and TM2 of the same subunit make 
fewer direct contacts. Perhaps lipids fill in to seal the bottom of the 
channel, in line with weak extra density around this location. 


Structure of the soluble domain 

Within the mitochondrial matrix resides the soluble domain of MCU, 
comprised of an NTD, a coiled-coil, and a variable non-conserved C 
terminus. This domain is involved in MCU assembly, regulation of 
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Fig. 3 | Cryo-EM structure of FgMCU in nanodiscs. a, Slice view of 

the FgMCU 3D reconstruction (type 1 conformation) filtered to 5A 
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and map calculation were not repeated. 


channel activity, and post-translational modification'*”>*4, We deter- 
mined the crystal structure of the soluble domain of MaMCU at 3.1A 
resolution (Extended Data Fig. 3c), providing further support of its 
quaternary organization. 

Notably, the globular NTD of MaMCU has the same structural fold 
as that of human MCU (root mean square deviation (r.m.s.d.) = 1.6 A) 
despite there being no significant sequence similarity (Extended 
Data Fig. 3d). Within the crystal lattice, the soluble domain forms a 
dimer-of-dimers configuration (Extended Data Fig. 2c), matching well 
with the corresponding part of the whole MCU structure. Notably, a 
dimer interface of NITDs in MaMCU is the same as an interface of 
human NTD found in multiple crystal forms'**° (Extended Data 
Fig. 3e). These observations support a conserved architecture from 
fungi to humans. 


Single-particle electron microscopy of MCU 

We used negative-stain electron microscopy to screen FEMCU 
and MaMCU exchanged into amphipols (A8-35 and PMAL-C8) 
or reconstituted into lipid nanodiscs. The negatively stained MCU 
proteins were monodisperse and showed two elongated densities 
extending parallel from the membrane region (Extended Data Fig. 4a). 
Cryo-EM analysis was carried out on MaMCU in A8-35, FgMCU in 
PMAL-C8, and FgMCU in nanodiscs (Extended Data Figs. 4, 5). The 
2D averages showed the soluble domains organized with an approx- 
imate two-fold symmetry. Following 3D classification without sym- 
metry (Extended Data Figs. 4, 5), the 3D reconstructions consistently 
showed a dimer-of-dimers structure. 3D refinement with two-fold 
symmetry generated final maps at 5-7 A resolution (Extended Data 
Table 2). The overall architectures of all three MCU complexes were 
essentially identical. 

In the best cryo-EM 3D reconstructions using Fg MCU in nano- 
discs (Extended Data Fig. 5), all eight transmembrane helices were 
well resolved and arranged with a near four-fold symmetry (Fig. 3a). 
The resolution was likely to be limited by the mobile transmembrane 
helices and flexible connection between the soluble and transmem- 
brane domains. Focused 3D classification revealed different structures 
(Extended Data Fig. 5c). One major class represented the conforma- 
tion in the averaged cryo-EM map (conformation type 1; 19% of par- 
ticles); another generated a 3D reconstruction showing less complete 
transmembrane helices but strong central densities, presumably con- 
tributed by hydrated ions or small molecules (conformation type 2; 
21% of particles). These two classes were refined to 5 A and 4.8 A res- 
olution, respectively, for the transmembrane domain (Extended Data 
Fig. 6). The type 1 conformation superimposes well onto the structure 
model of MaMCU (Fig. 3b and Extended Data Fig. 6g). In the type 2 
conformation, the lower halves of two opposing inner transmembrane 
helices appear to bend towards the centre (Extended Data Fig. 6g), pos- 
sibly representing a more closed conformation. 
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view of MaMCU NTD. Domain organization is depicted using schematic 
drawings. b, Cellular-side view of MaMCU transmembrane domain. 
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Stoichiometry and symmetry mismatch of MCU 

To test whether our observed tetrameric organization represents 
the quaternary structure of MCU in the membrane, we performed 
site-directed disulfide bridge crosslinking. On the basis of the struc- 
ture, we introduced a pair of cysteines at the protomer interface of 
the transmembrane domain on a cysteine-free background (Extended 
Data Fig. 7a). In either detergent micelle or the isolated membrane, 
crosslinking under oxidizing conditions gave rise to a newly formed 
product with the expected size of a tetramer in a time-dependent man- 
ner (Extended Data Fig. 7b, c). Furthermore, MaMCU crosslinked 
in the membrane migrated in the same way as untreated protein on 
size-exclusion chromatography (Extended Data Fig. 7d). To evaluate 
whether the architecture of MCU is conserved in metazoan MCU, we 
made equivalent double cysteine mutations in MCU from mosquito, 
which possesses EMRE and MICU in the genome. Under oxidation 
conditions, the main cross-linked product migrated to the same 
tetramer position as that of MaMCU on SDS-PAGE (Extended Data 
Fig. 7e). Comparison of 2D averages of negatively stained mosquito 
MCU with those of FgMCU indicated a similar overall architecture 
(Extended Data Fig. 7f). 

Overall, MCU is organized as a dimer of dimers. The soluble domain 
shows two-fold symmetry, with each NTD directly contacting two 
neighbouring subunits through distinct interfaces (Fig. 4a). By con- 
trast, the transmembrane domain shows approximately four-fold rota- 
tional symmetry (Fig. 4b). This symmetry mismatch is evident from 
the slice views of the cryo-EM map (Fig. 3a) and is illustrated by mark- 
ers on each domain in the crystal structure (Extended Data Fig. 7g). 
Superimposing subunits A and C with neighbouring subunits B and 
D revealed how the symmetry mismatch might be reconciled (Fig. 4c). 
First, the coiled-coil domain swings approximately 8° around its end 
near the NTD. This movement propagates through a long distance to 
the top of the transmembrane domain (about 85 A away), resulting ina 
large displacement of the transmembrane domain. Second, the junction 
between the transmembrane domain and the coiled-coil is flexible and 
accommodates a roughly 20° bend right after the end of the coiled-coil, 
moving the transmembrane domain farther away. As a result, subunits 
A and C adopt a substantially different conformation from subunits B 
and D, even though each individual domain superimposes well. Both 
ends of the coiled-coil are likely to function as joints, providing flexi- 
bility for different relative orientations. 


Functional studies of transmembrane domain 

To validate the structural model and assess the functional role of res- 
idues in the transmembrane domain, we performed a systematic ala- 
nine scan. Consistent with the structural observations, single point 
mutations that substantially impaired uptake activity were mostly con- 
centrated on the pore-facing side of TM2, with a helical periodic distri- 
bution pattern (Fig. 5a, b). The strictly conserved Trp332, Pro337, and 
Glu336, which are essential for proper configuration of the selectivity 
filter, were indispensable for uptake. Substitution of Tyr340, which sits 
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Fig. 5 | Functional validation of transmembrane domain. a, E. coli-based 
Ca’* uptake activities of MaMCU with individual alanine (or leucine 

for alanine/glycine) point mutations in transmembrane domain in the 
presence of 50,.M Ca” (mean + s.e.m., n=5 independent experiments 


directly below the filter, substantially impaired uptake and proper pro- 
tein folding, indicating an important structural role while complicating 
specific functional interpretation. The residue on TM1 that showed 
the most pronounced effect on uptake activity was Trp317, whose side 
chain constitutes a primary contact point between TM1 and TM2. 
Mutating Phe326 or Gly331 of the TM1-TM2 linker also impaired 
Ca** uptake, probably owing to their effect on the linker conformation 
and thus the configuration of the pore entrance. 

We examined the effect of the mutations on the sensitivity of MCU 
to ruthenium red by testing residues near the intermembrane surface. 
Substitution of D333A had the most marked effect and substantially 
reduced sensitivity to ruthenium red (Extended Data Fig. 8), consist- 
ent with an equivalent mutation of the haman MCU”. Ser259 of the 
human MCU, another residue involved in ruthenium red sensitiv- 
ity, is not conserved in MaMCU or FgMCU'™"". Perhaps positively 
charged ruthenium red interacts with negatively charged Asp333 near 
the entrance of the pore and blocks the channel, competing with the 
Ca’*-binding site. This would explain why ruthenium red inhibited 
inward but not outward current of MCU*%. Together, our mutagenesis 
studies are fully compatible with the structure, providing insights into 
MCU function. 


Discussion 

Our X-ray and electron microscopy structures, together with mem- 
brane cross-linking and functional experiments, have revealed a 
conserved dimer-of-dimers architecture of MCU with physiological 
relevance. The marked difference between our MCU and the cMCU 
structures might be related to the approaches used, as the CMCU sam- 
ple preparation involved the relatively harsh detergent Fos-choline-14 
and inclusion bodies, and the oligomerization state of cMCU was 
inferred from an 18 A-resolution negative-stain electron microscopy 
reconstruction”®. 

The channel domain of MCU represents a previously undescribed 
architecture of ion channels. Compared with voltage-gated ion chan- 
nels (VGICs), MCU has a roughly inverted shape resembling a trun- 
cated pyramid. Moreover, the selectivity filter of MCU is formed by 
side chains on TM2, as opposed to a reentry loop lining the pore for 
VGICs?”-“, Nonetheless, MCU, Ca**-conducting VGICs, and the 
hexameric Orai Ca** channel share a common theme: a carboxylate 
ring at the entrance of the selectivity filter is likely to provide negative 
charge to effectively coordinate Ca**. This convergent configuration 
indicates a fundamental requirement for Ca”* channels that is achieved 
by different means. Compared to Orai, the pore of MCU is mostly 
hydrophilic and wider open, which may contribute to its relatively high 
conductance. 

MCU has high selectivity with a high-affinity binding site to Ca?* 
in the pore®, which is essential for its physiological function, given the 
relatively low cytosolic Ca’* concentration. We propose that the high 
selectivity of Ca** is achieved by high-affinity binding through direct 
coordination by the carboxylate oxygen of the Glu336 side chains. The 
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except for mutants on residues 321-335, n= 4 independent experiments). 
b, Function-impairing residues. Transmembrane helices (grey) and 
residues whose substitution substantially affected Ca”* uptake (below 
arbitrary 0.4 cut-off; yellow sticks) are shown. 


conserved interaction network among Trp332, Glu336, and Pro337 
stabilizes closely spaced carboxylate groups of the Glu336 ring, which 
might contribute to the high-affinity binding of Ca**. Asp333 and 
Glu336 of MCU showed a notable parallel with the engineered CayAb 
Ca** channel, in which two adjacent rings of negatively charged 
residues also line the mouth of the pore and are important for Ca?* 
selectivity’. We observed only one strong Ca”* binding site in the 
crystal structure. It remains to be investigated whether Asp333 is also 
involved in coordinating a secondary Ca’ for a potential knock-off 
mechanism*, 

Post-translational modifications or divalent cation binding to 
the NTD can modulate Ca?* conduction by MCU!*?>#4. In this 
regard, it is notable that the symmetry mismatch between the 
transmembrane domain and the soluble NTD is similar to that of 
ionotropic glutamate receptors (iGluRs)*°. During the gating cycle of 
iGluRs, the ligand-binding domain undergoes substantial re-organi- 
zation and transitions between two-fold and four-fold symmetry“. By 
analogy, a similar reorganization in the NTD could modulate MCU 
channel activity by switching subunits A and C or B and D between 
alternate conformations, thereby altering the packing or conforma- 
tion of the transmembrane helices. In the crystal structure of MaMCU, 
immediately below the selectivity filter, Tyr340 interacts with residues 
on the neighbouring subunit, orienting its side chain along the pore. 
Notably, the cryo-EM map of FEMCU shows density in the pore that 
could potentially result from one pair of Tyr340 side chains pointing 
into the pore. Although higher resolution is needed for a definitive 
assignment, an intriguing possibility is that altered transmembrane 
packing may weaken the interactions of Tyr340 with nearby transmem- 
brane helices to facilitate a rotamer switch of one pair of tyrosines to 
control Ca’* flow through the pore. Alternatively, the rearrangement 
of NTDs could potentially lead to bending of transmembrane helices 
to constrict the pore, as suggested by two possible conformations in 
the cryo-EM map. 

In summary, our structures and biochemical studies establish the 
stoichiometry and architecture of MCU and provide important insights 
into Ca?* coordination and permeation. In the future, structures at 
higher resolutions, in various states, and in complex with regulatory 
components will be needed for a detailed mechanistic understanding 
of Ca?* selectivity and channel gating. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0330-9. 


Received: 7 January 2018; Accepted: 17 May 2018; 
Published online 11 July 2018. 


1. Gunter, T. E. & Pfeiffer, D. R. Mechanisms by which mitochondria transport 
calcium. Am. J. Physiol. 258, C755-C786 (1990). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
19. 


20. 


21. 


22. 
23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


Rizzuto, R., De Stefani, D., Raffaello, A. & Mammucari, C. Mitochondria as 
sensors and regulators of calcium signalling. Nat. Rev. Mol. Cell Biol. 13, 
566-578 (2012). 

Szabadkai, G. & Duchen, M. R. Mitochondria: the hub of cellular Ca2* signaling. 
Physiology (Bethesda) 23, 84-94 (2008). 

Deluca, H. F. & Engstrom, G. W. Calcium uptake by rat kidney mitochondria. 
Proc. Natl Acad. Sci. USA 47, 1744-1750 (1961). 

Vasington, F. D. & Murphy, J. V.Ca ion uptake by rat kidney mitochondria 

and its dependence on respiration and phosphorylation. J. Biol. Chem. 237, 
2670-2677 (1962). 
Kirichok, Y., Krapivinsky, G. & Clapham, D. E. The mitochondrial calcium 
uniporter is a highly selective ion channel. Nature 427, 360-364 (2004). 

De Stefani, D., Rizzuto, R. & Pozzan, T. Enjoy the trip: calcium in mitochondria 
back and forth. Annu. Rev. Biochem. 85, 161-192 (2016). 

Rizzuto, R., Brini, M., Murgia, M. & Pozzan, T. Microdomains with high Ca?* close 
to IP3-sensitive channels that are sensed by neighboring mitochondria. Science 
262, 744-747 (1993). 
Giacomello, M. et al. Ca** hot spots on the mitochondrial surface are generated 
by Ca2+ mobilization from stores, but not by activation of store-operated Ca?+ 
channels. Mol. Cell 38, 280-290 (2010). 
Baughman, J. M. et al. Integrative genomics identifies MCU as an essential 
component of the mitochondrial calcium uniporter. Nature 476, 341-345 
(2011). 

De Stefani, D., Raffaello, A., Teardo, E., Szabd, |. & Rizzuto, R. A forty-kilodalton 
protein of the inner membrane is the mitochondrial calcium uniporter. Nature 
476, 336-340 (2011). 

Chaudhuri, D., Sancak, Y., Mootha, V. K. & Clapham, D. E. MCU encodes the pore 
conducting mitochondrial calcium currents. eLife 2,e00704 (2013). 
Kovacs-Bogdan, E. et al. Reconstitution of the mitochondrial calcium uniporter 
in yeast. Proc. Natl Acad. Sci. USA 111, 8985-8990 (2014). 

Lee, S. K. et al. Structural insights into mitochondrial calcium uniporter 
regulation by divalent cations. Cell Chem. Biol. 23, 1157-1169 (2016). 

Csordas, G. et al. MICU1 controls both the threshold and cooperative activation 
of the mitochondrial Ca2* uniporter. Cell Metab. 17, 976-987 (2013). 
allilankaraman, K. et al. MCUR1 is an essential component of mitochondrial 
Ca?*+ uptake that regulates cellular metabolism. Nat. Cell Biol. 14, 1336-1343 
(2012). 

Kamer, K. J., Grabarek, Z. & Mootha, V. K. High-affinity cooperative Ca?* binding 
by MICU1-MICU2 serves as an on-off switch for the uniporter. EMBO Rep. 18, 
1397-1411 (2017). 

Perocchi, F. et al. MICU1 encodes a mitochondrial EF hand protein required for 
Ca?* uptake. Nature 467, 291-296 (2010). 

Liu, J. C. et al. MICU1 serves as a molecular gatekeeper to prevent in vivo 
mitochondrial calcium overload. Cell Reports 16, 1561-1573 (2016). 

Patron, M. et al. MICU1 and MICU2 finely tune the mitochondrial Ca?* 
uniporter by exerting opposite effects on MCU activity. Mol. Cell 53, 726-737 
(2014). 
Sancak, Y. et al. EMRE is an essential component of the mitochondrial calcium 
uniporter complex. Science 342, 1379-1382 (2013). 

Tsai, M. F. et al. Dual functions of a small regulatory subunit in the mitochondrial 
calcium uniporter complex. eLife 5, e15545 (2016). 

Raffaello, A. et al. The mitochondrial calcium uniporter is a multimer that can 
include a dominant-negative pore-forming subunit. EMBO J. 32, 2362-2376 
(2013). 

Chaudhuri, D., Artiga, D. J., Abiria, S.A. & Clapham, D. E. Mitochondrial calcium 
uniporter regulator 1 (MCUR1) regulates the calcium threshold for the 
mitochondrial permeability transition. Proc. Nat! Acad. Sci. USA 113, 
E1872-E1880 (2016). 

Tomar, D. et al. MCUR1 Is a scaffold factor for the MCU complex function and 
promotes mitochondrial bioenergetics. Cell Reports 15, 1673-1685 (2016). 
Bick, A. G., Calvo, S. E. & Mootha, V. K. Evolutionary diversity of the 
mitochondrial calcium uniporter. Science 336, 886 (2012). 

Mammucari, C., Gherardi, G. & Rizzuto, R. Structure, activity regulation, and role 
of the mitochondrial calcium uniporter in health and disease. Front. Oncol. 7, 
139 (2017). 

Oxenoid, K. et al. Architecture of the mitochondrial calcium uniporter. Nature 
533, 269-273 (2016). 

Song, J., Liu, X., Zhai, P., Huang, J. & Lu, L.A putative mitochondrial calcium 
uniporter in A. fumigatus contributes to mitochondrial Ca2+ homeostasis and 
stress responses. Fungal Genet. Biol. 94, 15-22 (2016). 

Shigetomi, E., Kracun, S., Sofroniew, M. V. & Khakh, B. S. A genetically targeted 
optical sensor to monitor calcium signals in astrocyte processes. Nat. Neurosci. 
13, 759-766 (2010). 


ARTICLE 


31. Hou, X., Pedi, L., Diver, M. M. & Long, S. B. Crystal structure of the calcium 
release-activated calcium channel Orai. Science 338, 1308-1313 (2012). 

32. Yang, W., Lee, H. W., Hellinga, H. & Yang, J. J. Structural analysis, identification, 
and design of calcium-binding sites in proteins. Proteins 47, 344-356 (2002). 

33. Saotome, K., Singh, A. K., Yelshanskaya, M. V. & Sobolevsky, A. |. Crystal structure 
of the epithelial calcium channel TRPV6. Nature 534, 506-511 (2016). 

34. Dong, Z. et al. Mitochondrial Ca@+ uniporter is a mitochondrial luminal redox 
sensor that augments MCU channel activity. Mol. Cell 65, 1014-1028.e7 
(2017). 

35. Lee, Y. et al. Structure and function of the N-terminal domain of the human 
mitochondrial calcium uniporter. EMBO Rep. 16, 1318-1333 (2015). 

36. Arduino, D. M. et al. Systematic identification of MCU modulators by orthogonal 
interspecies chemical screening. Mol. Cell 67, 711-723 (2017). 

37. Liao, M., Cao, E., Julius, D. & Cheng, Y. Structure of the TRPV1 ion channel 
determined by electron cryo-microscopy. Nature 504, 107-112 (2013). 

38. Doyle, D. A. et al. The structure of the potassium channel: molecular basis of Kt 
conduction and selectivity. Science 280, 69-77 (1998). 

39. Yan, Z. et al. Structure of the rabbit ryanodine receptor RyR1 at near-atomic 
resolution. Nature 517, 50-55 (2015). 

40. Sobolevsky, A. |., Rosconi, M. P. & Gouaux, E. X-ray structure, symmetry and 
mechanism of an AMPA-subtype glutamate receptor. Nature 462, 745-756 
(2009). 

41. Fan, G. et al. Gating machinery of InsP3R channels revealed by electron 
cryomicroscopy. Nature 527, 336-341 (2015). 

42. Zalk, R. et al. Structure of a mammalian ryanodine receptor. Nature 517, 44-49 
(2015). 

43. Efremov, R. G., Leitner, A., Aebersold, R. & Raunser, S. Architecture and 
conformational switch mechanism of the ryanodine receptor. Nature 517, 
39-43 (2015). ; 

44, Wu, J. et al. Structure of the voltage-gated calcium channel Cay1.1 at 3.6A 
resolution. Nature 537, 191-196 (2016). 

45. Tang, L. et al. Structural basis for Ca?* selectivity of a voltage-gated calcium 
channel. Nature 505, 56-61 (2014). 

46. Zhu, S. & Gouaux, E. Structure and symmetry inform gating principles of 
ionotropic glutamate receptors. Neuropharmacology 112, 11-15 (2017). 


Acknowledgements We thank the staff at Beamline 24ID-C/E, 231D-B/D, 

and 17ID (APS, Argonne National Laboratory), 12-2 (SSRL, SLAC National 
Laboratory), and 5.0.2 (Advanced Light Source). We thank Z. Li at HMS and C. 
Xu and K. Song at UMass cryo-EM facility for help with EM data collection. This 
work was made possible by support from Stanford University, the Klingenstein- 
Simons Fellowship, and the Harold and Leila Y. Mathers Charitable Foundation 
to LF, and from a National Science Foundation Graduate Research Fellowship 
to N.M.F. NE-CAT is supported by NIH P41 GM103403 and S10 RRO29205. We 
thank A. Manglik for discussions and R. Lewis for suggestions and comments. 


Reviewer information Nature thanks M. Prakriya, D. Stokes and the other 
anonymous reviewer(s) for their contribution to the peer review of this work. 


Author contributions C.F. and M.F. performed protein purification, biochemistry, 
crystallization, data collection, crystallography and functional assays. B.J.O. 
performed negative-stain EM, cryo-EM, and data processing. N.M.F. performed 
protein engineering, crystallization, and data collection. Y.X. performed protein 
purification and biochemistry. J.Z. performed crystallography. M.G.C. helped 
with initial EM screening. X.X. helped with initial characterization. K.P. assisted 
with crystallography. M.L. oversaw the EM studies and contributed to EM 

data collection and processing. L.F. conceived the project, oversaw the X-ray 
structural, biochemical, and functional studies and contributed to experimental 
work and crystallography. L.F. and M.L. wrote the manuscript with input from all 
co-authors. 


Competing interests The authors declare no competing interests. 


Additional information 

Extended data is available for this paper at https://doi.org/10.1038/s41586- 
018-0330-9. 

Supplementary information is available for this paper at https://doi. 
org/10.1038/s41586-018-0330-9. 

Reprints and permissions information is available at http://www.nature.com/ 
reprints. 

Correspondence and requests for materials should be addressed to M.L. or L-F. 
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional 
claims in published maps and institutional affiliations. 


26 JULY 2018 | VOL 559 | NATURE | 579 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


METHODS 


Expression and purification. The coding sequence of Metarhizium acridum MCU 
without mitochondrial signal peptide (residues 62-484) was codon optimized and 
cloned into a modified pNGFP-BC plasmid with an N-terminal His-tag, GFPuv, 
and a 3C protease cleavage site. Based on limited proteolysis and secondary struc- 
ture prediction, the unstructured N- and C- termini and a disordered loop at the 
NTD were removed. A construct lacking residues 62-98, 190-205, and 427-484 
yielded weakly diffracting crystals. To promote better crystal growth, Clostridium 
pasteurianum rubredoxin was fused into a loop between S213 and G214 at the 
NTD, and two point mutations (H327W and M330F) were introduced. This con- 
struct, named MaMCU-Rub, was fully functional and produced improved crystals. 
To prepare MaMCU soluble domain, the N- terminal and C-terminal parts of 
the protein were directly fused together by deleting the transmembrane (TM) 
domain (A295-386). A disordered long loop in the NTD (residues 193-222) was 
further truncated. The final construct was named MaMCUsd and cloned into 
pNGFP-BC. Similarly, the coding sequence of Fusarium graminearum MCU was 
codon optimized and cloned into pNGFP-BC. The mitochondrial signal peptide 
and disordered N- and C termini were truncated (A1-120 and A448-480). Two 
disordered and non-conserved loops (residues 148-159 and 215-241) within the 
NTD were also removed. 

To prepare the MaMCU protein sample, the transformed E. coli BL21 (DE3) 
cells were grown at 37 °C until the OD¢oo reached 0.8-1.0 and the temperature was 
lowered to 20°C. Overexpression was induced by 0.2 mM isopropyl 8-D-thioga- 
lactoside and the cells were harvested 18h post induction. Cells were lysed by 
sonication in a buffer containing 50 mM Tris, pH 8.0, 150mM NaCl, 2mM CaCh, 
and protease inhibitor cocktail. Membrane proteins were extracted by incubating 
the lysate or the isolated membrane with 2% n-dodecyl-8-D-maltoside (DDM, 
Anatrace) at 4°C for 2h. After removal of the insoluble fractions, the supernatant 
was incubated with prewashed cobalt resin at 4°C for 2h. After washing the resin 
with five column volumes (CV) of buffer A containing 20 mM Tris, pH 8.0, 150 mM 
NaCl, 2mM CaCl, and 0.05% DDM supplemented with 10 mM imidazole and 
10 CV of buffer A supplemented with 20 mM imidazole, the resin was washed 
with buffer A and then incubated with 3C protease overnight to cleave the tag. 
MaMCU was further purified by gel filtration (Superdex 200 increase, GE health- 
care) in buffer containing 20 mM Tris, pH 8.0, 150mM NaCl, 2mM CaCh, and 
0.03% DDM. The peak fraction was collected and concentrated to 10-15 mg/ml 
for further use. 

MaMCU-Rub was purified similarly with a few modifications. The cell pel- 
lets were suspended in lysis buffer containing 50 mM Tris, pH 8.0, 150mM NaCl, 
10mM MgCh, and protease inhibitor cocktail. After sonication, membrane 
proteins were extracted by 2% n-decyl-8-D-maltoside (Anatrace) at 4°C for 2h. 
The insoluble fractions were removed by centrifugation and the supernatant was 
incubated with prewashed cobalt resin at 4°C for 2h. The protein was purified 
by affinity chromatography using cobalt resin, released from the resin by 3C pro- 
tease, and further purified by gel filtration (Superdex 200 increase, GE Healthcare) 
in buffer containing 10 mM HEPES, pH 7.5, 150mM NaCl, 10mM MgCh, and 
0.1% n-undecyl-6-D-maltoside (Anatrace). The peak fraction was collected and 
concentrated to 10 mg/ml for crystallization. Expression of selenomethionine- 
labelled MaMCU-Rub protein was achieved by following a published protocol*”. 
The protein was purified in the same manner as the unlabelled protein. To produce 
mercury-labelled MaMCU-Rub protein, cysteine point mutations were introduced 
by two-step PCR", and the protein was incubated with 2.5 mM thimerosal at room 
temperature for 2h before further purification by gel filtration. 

The soluble matrix domain of MaMCU, MaMCUsd, was expressed and puri- 
fied following the same protocol as described above, except that surface reductive 
methylation was performed after elution from the affinity column” and the gel 
filtration buffer did not contain any detergent. The peak fraction was collected and 
concentrated to 10 mg/ml for crystallization trials. 

FgMCU proteins were purified following the same protocol above for MaMCU. 
The final gel filtration buffer contained 20 mM Tris, pH 8.0, 150mM NaCl, 2mM 
CaCl, and 0.03% DDM. 

Nanobody selection and purification. Nanobodies were obtained from screen- 
ing immunized and synthetic libraries following published protocols*®*’. Phage 
display libraries that contain about 10° transformants were used to screen spe- 
cific binders to MCUs. After four rounds of selection, specific binders against 
MCUs were clearly enriched. The enriched clones were transferred into a yeast 
display system and further screened against MaMCU for three additional rounds 
following published protocols*”. Nanobodies that specifically recognize MaMCU 
were cloned into expression vector pMES4 with His tag. Five nanobodies were 
expressed in E. coli BL21 (DE3) and purified according to the standard procedure 
using cobalt resin. 

Crystallization. Crystallization trials were performed using the hanging-drop 
vapour-diffusion method at 20°C. MaMCU-Rub protein was mixed at a 1:0.8 
ratio with reservoir solution containing 100 mM HEPES, 7.5, 0.1M MgCh, and 


14% PEG 2000 MME. The crystals grew to full size in one to two weeks. Crystals 
were incubated in a cryoprotectant solution with the addition of 20% glycerol, 
and were flash-frozen in liquid nitrogen. MaMCU in complex with nanobody 
was prepared by mixing both components at a 1:1 ratio. The crystals were grown 
by vapour-diffusion in the presence of 15% PEG 2050, 200 mM (NH4)2SOu,, and 
100 mM Na-cacodylate, pH 6.0. The crystals grew to full size in about one week. 
The crystals were cryo-protected by the addition of 20% glycerol and flash frozen 
in liquid nitrogen. The MaMCUsd crystals were grown using the same method 
as described above. Plate-like crystals were grown in 0.1 M glycine, pH 9.3, 15.5% 
PEG 4000, and 0.6 M NaNO. The crystals were cryo-protected by adding 24% 
ethylene glycol into the crystallization buffer, and were then flash-frozen in liquid 
nitrogen. 

Data collection and structural determination. The diffraction data were collected 
at beamlines 24ID-C/E, 23ID-B/D and 17ID of the Advanced Photon Source, 5.0.2 
of the Advanced Light Source, and 12-2 of the Stanford Synchrotron Radiation 
Lightsource. Native data set for MaMCU-Rub, and MaMCuUsd was collected 
based on a single crystal and native data set for MaMCU was merged from three 
crystals. The diffraction data were processed using XDS** or HKL2000™*. The 
selenomethionine anomalous data were merged from multiple crystals. The data 
were integrated using XDS™ as implemented in RAPD (https://github.com/RAPD/ 
RAPD). The mercury anomalous data were collected on multiple crystals or 
multiple spots on the same crystals and were processed by XDS°? or HKL2000°4. 
The initial phase for MaMCUsd was obtained using a 10 A EM map asa search 
model using Phaser*, and the phase was gradually extended through NCS averag- 
ing to a 3.1 A resolution by RESOLVE through Phenix”*. The map was of sufficient 
quality to trace secondary structures. The initial model was built using AutoBuild 
in Phenix. The model was manually rebuilt in Coot” and refined by Phenix” in an 
iterative manner. The initial single wavelength anomalous dispersion (SAD) phas- 
ing for MaMCU-Rub (crystal form I) was obtained from either selenomethionine 
anomalous data or mercury anomalous data using SHELX C/D/E in HKL2Map™. 
In addition, an MR-SAD (combination of molecular replacement and single wave- 
length anomalous dispersion) protocol was carried out for the selenomethionine 
anomalous data and mercury anomalous data based on a molecular replacement 
(MR) solution with the soluble domain model (MaMCUsd) as the search model. 
SAD and MR-SAD maps allowed tracing and registering the soluble domain with 
the aid of the MaMCUsd model. For the TM domain, initially ideal poly-alanine 
helices were manually placed into the SAD and MR-SAD maps, guided by the EM 
map. To assist assignment of the helix register, we made single cysteine mutants 
for each residue on the TM helices. The accessibility of cysteine for the mercury 
labelling was assessed by 5,5’-Dithiobis(2-nitrobenzoic acid) on the purified pro- 
tein. For each mutant, the MR solution was obtained by Phaser*>. The Hg sites were 
identified by Phaser EP in Phenix” and confirmed by anomalous difference maps. 
The register of the TM helices were aided by 12 Hg markers on two TM helices, one 
Hg marker on the linker connecting TM1 and TM2, as well as a selenomethione 
marker on TM2. For MaMCU in complex with the nanobody (crystal form II), 
the combined and scaled data from 3 crystals was anisotropically truncated using 
UCLA Diffraction Anisotropy Server. The data (3.9 A x 3.1 A x 3.3 A along a*, b*, 
and c* axes, respectively) was used for refinement. The molecular replacement 
solution was found using MaMCU-Rub as a search model by Phaser**. The model 
was iteratively refined and rebuilt in Phenix™®, Buster*? and Coot*”. This final 
model was used for all structural interpretation on MaMCU. Structural quality was 
reported by MolProbity® and structural renderings were prepared using PYMOL 
(Schrédinger, LLC). lon-conducting pore radius was calculated by HOLE”. 
Cryo-EM sample preparation. To reconstitute the MCU in amphipol, puri- 
fied proteins after gel filtration were mixed with amphipol A8-35 (Anatrace) or 
PMAL-C8 (Anatrace) at a mass ratio of 1:3 for 3h on ice. To prepare the MCU in 
nanodiscs, purified proteins were mixed with an equal molar ratio of the scaffold 
protein MSP1D1 in the presence of an 8 mM lipid mixture that contained 3:1 
2-oleoyl-1-palmitoyl-sn-glycero-3-phosphocholine (POPC, Avanti) and cardi- 
olipin (Avanti) for 2h on ice. The protein samples with amphipol and nanodiscs 
were incubated with Bio-beads at 4°C overnight to remove detergents and were 
further purified by gel filtration in a buffer containing 150 mM NaCl, 10mM Tris, 
pH 8.0, and 2mM CaCl. The peak fraction was collected for EM analysis. 
Cell-based uptake assay. The MCU and a membrane-tethering calcium sensor 
Lck-GCaMP2** were co-expressed using a pETDuetl vector. The transformed 
E. coli strain BL21(DE3) was induced as described above. A 3 ml cell culture was 
centrifuged and washed twice with 3 ml UAB buffer (100 mM KCl, 50 mM Tris, pH 
7.5, 1mM MgCl), and then diluted to an OD600 of 1.0 for an uptake assay. A large 
negative membrane potential across the E. coli plasma membrane® drives Ca?* 
uptake. The assays were carried out using Greiner black polystyrene flat-bottom 
96-well plates (Sigma-Aldrich). For each assay reaction, 501M or 0.5mM calcium 
was added into 10011 cells and immediately mixed to initiate the uptake. For ruthe- 
nium red inhibition experiments, 0.5-2 1M ruthenium red (Sigma-Aldrich) was 
mixed with cells before uptake assays. All experiments were performed at room 
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temperature and repeated at least four times. Data were recorded on an infinite 
M1000Pro plate reader and processed using Tecan i-contro software. Fluorescent 
signals were measured every 5-10s with the excitation set at 485 nm and emission 
at 510nm. Ca”* uptake activity is quantified as a linear fit of the fluorescence sig- 
nal during the first 20s after the addition of calcium. To evaluate the expression 
level of the MCU mutants, MCU mutant constructs were cloned into pNGFP-BC. 
Proteins were expressed in 3 ml LB culture under the identical condition as for the 
functional assay. Solubilized proteins were analysed by SDS-PAGE and MCU- 
GFP fusion protein was visualized by in gel fluorescence using a Bio-Rad imaging 
system with excitation at 302 nm and emission at 440 nm. 

Liposome-based flux assay. Liposome was reconstituted according to a published 
protocol®, Purified MCU proteins were mixed with E. coli polar lipids (Avanti) at a 
1:80 ratio in the presence of reconstitution buffer (150 mM NaCl, 10 mM HEPES, 
pH 7.5, 5mM EDTA) plus 0.12% (w/v) Triton-X100. Detergents were removed by 
overnight incubation with Bio-beads. After extrusion through a 0.4|1m membrane, 
proteoliposome was pelleted by ultracentrifugation at 100,000 g for 1h. The pro- 
teoliposome was washed twice with ice-cold FAB buffer that contained 150 mM 
NMDG, 10mM HEPES, pH 7.5, 5mM EDTA and 21.M ACMA, and then resus- 
pended in the same buffer to a final lipid concentration of 100 mg/ml. The flux 
assay was performed following published protocols?) in a 96-well format. A 
total of 211 of proteoliposomes were diluted into 100,11 of FAB. After 150s, 114M 
carbonyl cyanide m-chlorophenyl hydrazone (CCCP, Sigma-Aldrich) was added 
to initiate Na* flux. A total of 20nM sodium ionophore monensin (Sigma-Aldrich) 
was added near the end of the experiments to release Nat from the liposomes. For 
Gd?+ blockage experiments, 0.3 mM GdCl; was added to the FAB buffer before flux 
assays. Data collected on an infinite M1000Pro plate reader were processed using 
Tecan i-contro software. Fluorescent signals were measured every 15s (excitation: 
410nm and emission: 490 nm). 

Cysteine crosslinking. Double cysteine substitutions were introduced in a 
cysteine-free version of MaMCU by two-step PCR“*. The mutants were expressed 
in the same manner as the WT protein. For crosslinking in detergent solution, 
1 mg/ml purified protein in DDM buffer (20 mM Tris, pH 8.0, 150 mM NaCl, 
2mM CaCl, 0.03% DDM) was incubated with 0.1 mM of Cu(II) (1,10-phenant- 
roline)3 at 4°C. For crosslinking in membrane, the membrane fraction containing 
overexpressed His-tagged protein was prepared and incubated with 0.3 mM of 
Cu(ID (1,10-phenantroline); at 4°C. The reactions were stopped at different time 
points by the addition of EDTA to a final concentration of 50 mM, and the reaction 
products were analysed by SDS-PAGE. For western blot analysis, the proteins were 
transferred from a gel to a PVDF membrane and probed with mouse anti-penta- 
His antibody (Qiagen), followed by detection with donkey anti-mouse IR680 sec- 
ondary antibody (LI-COR Biosciences). Blots were visualized and analysed using 
a LI-COR Odyssey system. Double cysteine substitutions of mosquito MCU was 
expressed and purified in DDM with 0.01% cholesteryl hemisuccinate similarly to 
MaMCU. The crosslinking was performed in DDM buffer with 0.01% cholesteryl 
hemisuccinate in a similar manner to MaMCU. 

Electron microscopy data acquisition. Negatively stained specimens were pre- 
pared following an established protocol with minor modifications’. Specifically, 
2.5 ul of purified MCU was applied to glow-discharged copper EM grids covered 
with a thin layer of continuous carbon film, and the grids were stained with 2.5% 
(w/v) uranyl formate. These grids were imaged on a Tecnai T12 electron micro- 
scope (FEI) operated at 120kV at a nominal magnification of 67,000x using a 
4k x 4k CCD camera (UltraScan 4000, Gatan), corresponding to a calibrated pixel 
size of 1.68 A on the specimen level. For cryo-EM of FEMCU in PMAL-C8 and 
nanodiscs, 4 11 of a5 mg/mL sample containing 0.3 mM fluorinated Fos-Choline-8 
(Anatrace) was applied to a glow-discharged Quantifoil holey carbon grid (1.2/1.3, 
400 mesh), and blotted for 3 s with ~92% humidity before plunge-freezing in liquid 
ethane using a Cryoplunge 3 System (CP3, Gatan). For cryo-EM of MaMCU in 
A8-35, a 2.5 11 sample at a concentration of 2mg/ml applied to a grid was similarly 
blotted and plunge-frozen using a CP3. The cryo-EM data were recorded on a 
300kV Polara electron microscope (FEI) at Harvard Medical School, and a 200kV 
Talos Arctica and a 300kV Titan Krios electron microscope (FEI) at the University 
of Massachusetts Medical School. All cryo-EM movies were recorded with a K2 
Summit direct electron detector (Gatan) in super-resolution counting mode using 
UCSFImage4® or SerialEM™. The details of the EM data collection parameters 
are listed in Extended Data Table 1. 

EM image processing. The EM data were processed as previously described”’. 
Negative-stain EM images were binned over 2 x 2 pixels for further processing, 
yielding a pixel size of 3.36 A. Dose-fractionated super-resolution movies collected 
using the K2 Summit direct electron detector were binned over 2 x 2 pixels, and 
then subjected to motion correction using the program MotionCor2”!. A sum of 
all frames of each image stack was calculated following a dose-weighting scheme, 
and used for all image-processing steps except for defocus determination. The 
program CTFFIND4” was used to calculate defocus values of the summed images 
from all movie frames without dose weighting. Particle picking was performed 
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using a semi-automated procedure with SAMUEL and SamViewer’®. 2D classifi- 
cation of selected particle images was carried out by SPIDER (samclasscas.py) or by 
RELION’*”®, Initial 3D models were generated with 2D class averages by SPIDER 
3D projection matching refinement (samrefine.py), starting from a cylindrical 
density mimicking the size and shape of the MCU. 3D classification and refine- 
ment were carried out using “relion_refine_mpi” in RELION. The masked 3D 
classification focusing on the TM domain of FgMCU in nanodiscs with residual 
signal subtraction was performed following a previously described procedure”®. 
The first round of 3D classification of all cryo-EM data sets was carried out with- 
out symmetry, which was followed by further 3D classification and refinement 
with two-fold symmetry. All refinements followed the gold-standard procedure, 
in which two half data sets were refined independently. The overall resolutions 
were estimated based on the gold-standard criterion of Fourier shell correlation 
(FSC) =0.143. Local resolution variations were estimated from two half data maps 
using ResMap”’. The amplitude information of the final maps were corrected by 
applying a negative B-factor using the program bfactor.exe’®. To compare the crys- 
tal structure with the cryo-EM maps, the atomic model was fitted into the cryo-EM 
maps as a rigid body using the function of “Fit in Map” in UCSF Chimera”. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. The crystallographic coordinates and structural factors are 
deposited into the Protein Data Bank with accession numbers 6C5W and 6C5R. 
The 3D cryo-EM density maps of MaMCU in A8-35, FEMCU in PMAL-C8 and 
FgMCU in nanodiscs (overall, type 1 and type 2 conformations) have been depos- 
ited into the Electron Microscopy Data Bank under accession numbers EMDB- 
7800, EMDB-7801, EMDB-7802, EMDB-7803 and EMDB-7804. 
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Extended Data Fig. 1 | Sequence alignment, sample preparations and 
functional characterizations. a, Sequence alignment of MCU. The protein 
sequences of MCUs from Fusarium graminearum (FgMCU), Metarhizium 
acridum (MaMCU), Homo sapiens (HsMCU) and Caenorhabditis elegans 
(CeMCVU) were aligned using Clustal Omega with manual adjustment. The 
domain organization is shown above the sequences. b, FEMCU in DDM 
on size-exclusion chromatography. c, MaMCU in DDM on size-exclusion 
chromatography. Data in b, c are representative of five independent 


Time (sec) 


experiments with similar results. d, Ca”* uptake activity of FEMCU in 
E. coli. Fluorescence changes upon addition of 0.5 mM Ca** are shown 
for wild-type (red), D351A/E354A (green), control (black), and with 
ruthenium red (blue). e, Cation flux activity of FEMCU in reconstituted 
liposomes. Fluorescence changes are shown for FgMCU (red) and 

with Gd?* (blue). Data in d, e are representative of three independent 
experiments with similar results. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | Crystal packing and molecule interactions in 
the crystal. a, The crystal lattice structure of MaMCU-Rub. One MaMCU 
channel is coloured blue and its associated rubredoxin is coloured cyan. 
In the crystal, the transmembrane domain lacks clear protein-mediated 
contacts. Presumably, relatively disordered protein regions or detergent 
micelles mediate the contact. b, The crystal lattice structure of MaMCU 
in complex with nanobody. One MCU is shown in blue and the bound 
nanobodies are shown in cyan. c, The crystal lattice structure of the 
MaMCU soluble domain, shown in ribbons. Each chain is coloured 
differently. Two helices that formed a coiled-coil were derived from two 
different chains, in a domain-swapped manner. d, Close-up view of the 


ARTICLE 


interaction between nanobody and McMCU in the crystal. Nanobody 
binds near the end of TM] and is distant from the selectivity filter 

(shown as coloured sticks). Nanobody is coloured grey and MaMCU is 
colored cyan. e, Mercury and selenium sites identified by MR-SAD. Two 
protomers of MaMCU in one asymmetric unit are shown as ribbons in 
green and blue, respectively. The mercury sites identified from ‘marker 
mutants’ are shown as red spheres; one selenium site from the labelled 
wild-type protein is shown in the same manner. f, 2F,—F, electron density 
map of MaMCU (cyan mesh, 1) with the structure model shown in 
ribbon. g, Close-up view of 2F,—F, electron density map of individual 
transmembrane helices (cyan mesh, 1.5c) with side chains shown in sticks. 
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MaMCU - crystallography 


Extended Data Fig. 3 | Comparison of the NMR structure of cMCU 
with the crystal structure of MaMCU, and the structure of the soluble 
domain of MaMCU. a, Left, pentameric NMR structure of cMCU. Right, 
dimer-of-dimers structure of MaMCU. Box outlines the equivalent region 
including the transmembrane domain and coiled-coil. b, Comparison 

of a subunit of CMCU by NMR with that of MaMCU by crystallography. 
The helical part of the pore-lining TM2 is superimposed. The rest of the 
protein shows markedly different structure. Right, expanded view of the 
selectivity filter region. The most conserved WDxxEP residues (shown as 


i NTD of MaMCU 
©} NTD of human MCU 


HcMCU - NMR 
@ MaMCU - crystallography 


i WA pair of MaMCU NTD 
© @ Apair of human MCU NTD 


sticks) are organized differently in these two structures. c, Ribbon 
representation of the soluble domain of MaMCU. Each subunit is coloured 
in blue, green, gold, or cyan, with the portions before and after the 
transmembrane domain of each subunit in slightly different colours. 

d, Structural comparison of the NTDs of MaMCU (purple) and human 
MCU (cyan). The two structures have the same fold and superimpose 

well onto each other. e, A pair of human MCU NTDs (cyan and blue) 
superimposes well onto one of dimers formed by MaMCU NTDs (purple 
and pink). 
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Extended Data Fig. 4 | Negative-stain EM and cryo-EM of MaMCU and 
FgMCU. a, Representative negative-stain EM image and 2D averages of 
MaMCU in A8-35 (from 28 micrographs with similar results), MaMCU 
in nanodiscs (from 28 micrographs with similar results), FEMCU in 
PMAL-C8 (from 20 micrographs with similar results), and FEMCU in 
nanodiscs (from 40 micrographs with similar results). The box dimension 
of the 2D averages is 215 A. b, Representative cryo-EM image (from 1,862 
micrographs with similar results), 2D averages, and image processing 


workflow of MaMCU in A8-35. The box dimension of the 2D averages 

is 172 A. c, Representative cryo-EM image (from 4,055 micrographs with 
similar results), 2D averages, and image processing workflow of FgMCU in 
PMAL-C8. The box dimension of the 2D averages is 176 A. d, Different 
views of the cryo-EM maps of MaMCU in A8-35 and FgMCU in 
PMAL-C8, both low-pass filtered to 7 A. 2D averages and map calculation 
were not repeated. 
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Extended Data Fig. 5 | Single-particle cryo-EM analysis of FEMCU in 
nanodiscs. a, Representative cryo-EM image (from 5,332 micrographs 
with similar results). b, 2D averages of cryo-EM particle images. The 
box dimension is 187 A. c, Image processing workflow. Two cryo-EM 
data sets were first independently subjected to 3D classification without 
symmetry, and the particle images in good classes were combined and 
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3D refinement (C2) 
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further processed with two-fold symmetry, generating a 4.9 A-resolution 
cryo-EM map. Subsequently, a final round of 3D classification focused on 
the transmembrane domain sorted out two major classes, which generated 
two final maps at 5.0 and 4.8 A resolution for the transmembrane domain. 
The conformations in these two maps are designated as type 1 and type 2. 
2D averages and map calculations were not repeated. 
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Extended Data Fig. 6 | Cryo-EM maps of two conformations of FEMCU 
in nanodiscs. a, Gold-standard FSC curves of the type 1 final cryo-EM 
map, calculated with a soft mask to include only the transmembrane 
domain (TMD, solid line) or the entire MCU molecule without nanodisc 
(dashed line). The estimated resolutions based on an FSC = 0.143 criterion 
are indicated. b, Cross-sectional view of the type 1 final cryo-EM map 
filtered to 5.0 A resolution and coloured according to local resolution. 
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c, Angular distribution of the particle images used for calculating the type 
1 final cryo-EM map. d-f, Same as a-c, but for the type 2 conformation. 
g, Two cross-sectional views of the transmembrane domains of the type 

1 and type 2 cryo-EM maps, superimposed onto an MaMCU-based 
homology model of FEMCU. The missing density and extra densities 

in the type 2 map are indicated. Map calculations in a and d were not 
repeated. 
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Extended Data Fig. 7 | Tetrameric assembly of MaMCU, overall analysed by size-exclusion chromatography. The peak fractions were 
architecture of a metazoan MCU and symmetry mismatch of MaMCU. subjected to SDS-PAGE and visualized by Coomassie blue staining. e, Site- 
a, The design of cysteine pair mutations for crosslinking transmembrane directed disulfide cross-linking of mosquito MCU in detergent solution. 
subunits. L314 on TM1 and M348 on TM2 were mutated to cysteines A pair of residues (T294 on TM1 and V341 on TM2) that were equivalent 
(side chains shown as sticks). b, Site-directed disulfide cross-linking of to those used in MaMCU crosslinking at the protomer interface were 
MaMCU in detergent solution. L314 on TM1 and M348 on TM2 were mutated to cysteine. The purified double cysteine mutant was oxidized 
mutated to create a cysteine pair at the protomer interface on an otherwise by CuP and analysed by SDS-PAGE. The tetramer position on the 
cysteine-free background. The double cysteine mutant was purified and SDS-PAGE gel is the same as that of MaMCU in b. f, 2D averages of 


oxidized by copper phenanthroline (CuP) for various time durations. The negatively stained FEMCU in nanodiscs, mosquito MCU (T294C/V341C) 
cross-link product was analysed by SDS-PAGE. c, Site-directed disulfide in nanodiscs, and mosquito MCU (T294C/V341C) in amphipol A8-35. 


cross-linking of MaMCU in the membrane. Isolated membrane from the 2D averages were not repeated. g, Geometry of markers in transmembrane 
MaMCU L314C/M348C mutant was treated with CuP for an increasing domain and NTD of MaMCU. Site-specific mercury marker on 
amount of time. The cross-linking product was analysed by western blot transmembrane domain (red spheres) and Ca position of Ile115 in NTD 


using an antibody against the His tag on the protein. d, The size-exclusion _ (blue spheres) are indicated. Data in b-e are representative of three 
profile of MaMCU with or without crosslinking. MaMCU L314C/M348C independent experiments with similar results. For gel source data, see 
mutant treated or untreated with CuP in the membrane was purified and Supplementary Fig. 1. 
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Extended Data Fig. 8 | Expression levels of MaMCU mutants and 
ruthenium red sensitivity of selected mutants. a, The expression level of 
wild-type MaMCU and those with a point mutation in the transmembrane 
domain. Green fluorescent protein-tagged protein was analysed on 
SDS-PAGE and the protein was visualized by in-gel fluorescence. The 
experiments were repeated twice independently with similar results. For 
gel source data, see Supplementary Fig. 1. b-m, The effect of substituting 


residues near the intermembrane surface of MaMCU on its ruthenium 
red sensitivity. Ca** uptake activity of the wild-type or mutant MaMCU 
was assayed in an E. coli-based assay (0.5 mM Ca**, 1M RuR). The Ca?* 
uptake in the absence (blue) or in the presence (black) of ruthenium red 
is reflected by fluorescence changes. The experiments were repeated three 
times independently with similar results. 
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Extended Data Table 1 | Statistics of X-ray data collection and refinement 


Data collection 
Space group 
Cell dimensions 
a, b, c (A) 
a, B, y (°) 
Resolution (A)* 


Wavelength (A) 
Rmerge (%)* 

// of 
Completeness (%)* 
Redundancy* 


Refinement 
Resolution (A) 
Unique reflections 


Rwork / Ritee (%) 
No. atoms 
Protein 
Ligand/ion 
B-factors 
Protein 
Ligand/ion 
R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


«High-resolution shell is shown in parentheses. 
#Reflections after anisotropic correction (3.9, 3.1, and 3.3A, respectively along a*, b*, and c* reciprocal cell directions) were used in the refinement. 


MaMCU 
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88.0, 290.4, 126.1 
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43.88 -3.05 
(3.16-3.05)" 
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99.9 (99.9) 
13.1 (13.0) 


45.0 - 3.10 
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25.5/30.1 


5746 
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75.71 
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0.54 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


MaMCU 
soluble domain 


C2 


261.8, 119.9, 88.0 
90.00, 106.88, 90.00 
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100.0 (100.0) 
3.4 (3.6) 
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Extended Data Table 2 | Cryo-EM data collection statistics 


Data collection and 

processing 

Microscope 

Magnification 

Voltage (kV) 

Electron exposure (e-/A’) 

Defocus range (um) 

Pixel size (A) 

Symmetry imposed 
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Initial particle images (no.) 

Final particle images (no.) 

Map resolution (A) (TMD) 
FSC threshold 
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Cryo-EM structures of fungal and 
metazoan mitochondrial calctum 


uniporters 


Rozbeh Baradaran!?, Chongyuan Wang?, Andrew Francis Siliciano! & Stephen Barstow Long!* 


The mitochondrial calcium uniporter (MCU) is a highly selective calcium channel and a major route of calcium entry into 
mitochondria. How the channel catalyses ion permeation and achieves ion selectivity are not well understood, partly 
because MCU is thought to have a distinct architecture in comparison to other cellular channels. Here we report cryo-electron 
microscopy reconstructions of MCU channels from zebrafish and Cyphellophora europaea at 8.5 A and 3.2 A resolutions, 
respectively. In contrast to a previous report of pentameric stoichiometry for MCU, both channels are tetramers. The atomic 
model of C. europaea MCU shows that a conserved WDXXEP signature sequence forms the selectivity filter, in which calcium 
ions are arranged in single file. Coiled-coil legs connect the pore to N-terminal domains in the mitochondrial matrix. In 
C. europaea MCU, the N-terminal domains assemble as a dimer of dimers; in zebrafish MCU, they form an asymmetric 
crescent. The structures define principles that underlie ion permeation and calcium selectivity in this unusual channel. 


The Ca?*+ channel of the inner mitochondrial membrane, known as the 
mitochondrial calcium uniporter (MCU), enables mitochondria to take 
up substantial amounts of Ca”* from the cytosol during intracellular 
Ca?* signalling events'~°. Ca’* influx stimulates mitochondrial ATP 
synthesis, controls the mitochondrial permeability transition that leads 
to cell death, and acts in a buffering role to shape cytosolic calcium 
signals®. A negative electric potential (approximately —160 mV) of the 
mitochondrial matrix relative to the cytosol, which is established by 
the proton gradient across the inner membrane, is the driving force 
for Ca? influx’. MCU is one of the most Ca**-selective channels 
known and must be highly discriminatory to enable the uptake of 
Ca?* while excluding other cytosolic cations, of which potassium is 
more than 100,000 times more abundant (Ca? Tevigeast ~0.1-1 uM and 
[K*] cytosol * 150 mM). The selectivity of MCU is of paramount impor- 
tance because substantial cation leakage could collapse the electro- 
motive force used for ATP synthesis. It is thought that selectivity is 
achieved, in part, by high-affinity binding of Ca (dissociation constant 
(Ka) < 2 nM) within a ‘selectivity filter’ region of the pore®. The selec- 
tivity filter might involve a conserved signature sequence (WDXXEP) 
near the channel's two predicted transmembrane helices**, but its 3D 
conformation is not known, partly because MCU shares no apparent 
sequence similarity to other ion channels. Ca** ions can move rapidly 
through the pore of MCU (maximally at about 5 x 10° Ca?* s~')8, 
and this raises the apparent paradox of how both ion selectivity and 
high conductance are achieved. 

MCU channels are present in most clades of eukaryotic life, including 
protists, plantae, amoebozoa, fungi, and metazoa’. The MCU protein 
subunit oligomerizes to form the Ca?*-conducting pore*, but channel 
activity is regulated differently among clades. Metazoans require the 
single-pass transmembrane protein EMRE for Ca?* uptake, whereas 
the genomes of fungi, amoeba, and plants do not have a recognizable 
EMRE orthologue'®. Many MCU channels are further regulated by 
MICU1 and MICU2 proteins, which are present in the intermembrane 
space!!?, and by the protein MCURI!*-". The structures of MCU 
from zebrafish and the fungus C. europaea presented here define the 


architecture of the channel and give insight into the mechanisms of ion 
permeation and Ca’* selectivity. 


Structure determination and oligomeric state 

C. europaea MCU (CeMCVU) and zebrafish MCU have 44% and 91% 
sequence similarity to human MCU, respectively, and were selected 
for structural determination on the basis of good biochemical 
stability in detergent-containing solutions (see Methods, Extended 
Data Fig. 1). Zebrafish MCU catalyses mitochondrial Ca** uptake 
when co-expressed with EMRE (Extended Data Fig. 2a-c). Uptake is 
blocked by ruthenium red and is dependent upon the proton gradient. 
Thus, zebrafish MCU recapitulates known properties of the human 
channel*!®, 

An orthologue of EMRE has not been identified in the genome 
of C. europaea. MCU channels from Dictyostelium discoideum and 
Arabidopsis thaliana, two other species without an EMRE, can mediate 
mitochondrial Ca?* uptake when heterologously expressed!®!”. 
However, we did not observe uptake when CeMCU or several other 
fungal MCU channels were expressed in yeast or mammalian cells, and 
conjecture that at least a subset of fungal channels may be regulated 
differently from other MCU channels. 

For cryo-electron microscopy (cryo-EM) sample preparation, full 
length CeMCU and zebrafish MCU were produced using eukaryotic 
expression systems and purified using gentle, non-denaturing, methods. 
In a previous NMR study of a truncated version of Caenorhabditis 
elegans MCU lacking an N-terminal domain (NTD), which resulted in 
a different structure from the one presented here'’, a prokaryotic sys- 
tem was used for expression and purification involved a relatively harsh 
detergent (foscholine-14), which is rarely used in structural studies of 
membrane proteins””. 

The small molecular weight of MCU, just approximately 110 kDa 
for the ordered region of the channel assembly, poses challenges for 
structure determination by cryo-EM. We overcame these by collecting 
micrographs at high magnification and high total electron dose 
(see Methods). De novo initial 3D models of CEMCU and zebrafish 
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Four-fold 


Fig. 1 | Cryo-EM map and overall structure of CeMCU. a, Slices of a 
3D reconstruction generated without imposing symmetry (Extended 
Data Fig. 3). b, Final 3D reconstruction viewed parallel to the membrane 
(80 contour level). Density for the detergent micelle is present at lower 
contours (for example, Extended Data Fig. 3). Subunits are coloured 
uniquely. c, Ribbon representation in the same orientation as b, with 
approximate membrane boundaries as grey bars. IMS, intermembrane 
space. Disordered regions connecting TM2 and a6 are indicated as 
dashed lines. 


MCU were generated independently and refined without imposing 
symmetry in order to avoid bias with regard to oligomeric state or 
overall structure (Extended Data Figs. 3, 4). The 3D reconstructions 
demonstrate that both channels assemble as tetramers of MCU subunits 
(Figs. 1, 2). 3D classification of the particle images, which would reveal 
global differences such as differences in oligomeric state, consistently 
yielded reconstructions that were tetrameric (Extended Data Figs. 3, 4). 
A tetrameric oligomeric state is supported by crosslinking experiments 
of purified protein and of human MCU within mitochondrial 
membranes (Extended Data Fig. 2h-j). The 3D reconstructions of 
CeMCU and zebrafish MCU were determined at overall resolutions 
of 3.2 Aand 8.5 A, respectively (Figs. 1, 2 and Extended Data Figs. 5, 6). 
Density for most amino acid side chains of CeMCU allowed us to 
construct an atomic model with good stereochemistry and good 
correlation with the cryo-EM density (Fig. 1, Extended Data Fig. 6 
and Extended Data Table 1). Because of the higher resolution, the dis- 
cussion of atomic details that follows derives from CeMCU. Although 
individual a-helices cannot be resolved in the map of zebrafish 
MCU, aside from differently organized NTDs, the overall structures of 
zebrafish MCU and CeMCU are similar (Figs. 1, 2). 


Overall architecture 

MCU has a novel architecture for an ion channel (Fig. 1, Extended 
Data Fig. 7). The molecular surface contains a hydrophobic transmem- 
brane domain (TMD) that would reside within the inner mitochondrial 
membrane in a cellular setting and a large hydrophilic region in the 
matrix (Extended Data Fig. 7d). The TMD has four-fold symmetry 
(Figs. 1, 2) with eight transmembrane helices (the two transmembrane 
helices (TM1 and TM2) from each of the subunits (Fig. 1)). The trans- 
membrane helices interact through mostly hydrophobic amino acids 
(Extended Data Fig. 7g). Short ‘turrets’ connecting these helices are 
the only protrusions into the intermembrane space. On the matrix 
side, four ‘legs’ extend from the TMD. The legs are composed of an 
N-terminal extension of the TM] helix (TM1,¢) and a C-terminal helix 
(a6) that forms a coiled-coil with it in the matrix (Fig. 1c, Extended 
Data Fig. 7a). Sequence analysis indicates that all MCU channels 
contain analogous coiled-coil regions’. The legs are separated from 
one another, creating ample space for ions to diffuse between them 
in the matrix. An NTD, which each MCU orthologue is predicted to 
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Fig. 2 | Cryo-EM reconstruction of zebrafish MCU. a, 2D slices of the 

3D reconstruction of zebrafish MCU are shown at the indicated positions. 
b, Two orthogonal views of the 3D reconstruction viewed parallel to 

the membrane. Grey bars indicate approximate membrane boundaries. 
Density attributed to the detergent micelle surrounding the TMD is 
indicated. c, Matrix view, showing density for four NTDs. The TMD region 
is removed for clarity. d, Atomic models (PDB: 4XTB) of four NTDs are fit 
into the density from c (mesh). 


contain”, is located at the end of each of the legs, approximately 70 A 
from the membrane. In CeMCU, the four NTDs are arranged as a 
dimer of dimers, and this yields an overall two-fold (C2) symmetry for 
the channel (Fig. 1). In zebrafish MCU, the four NTDs are arranged 
as an asymmetric crescent (Fig. 2). Thus, in both channels, there is 
a symmetry transition between the four-fold symmetric TMD and 
the matrix region. The cryo-EM structures differ substantially from 
the previous structural model of MCU that was deduced from NMR 
experiments'® both on the subunit level and in terms of its assembly 
(Extended Data Fig. 8). 


Ion pore 
MCU contains a single ion pore in the TMD that is located along the 
symmetry axis and would be perpendicular to the membrane in a cellular 
setting (Figs. 1, 3). The secondary structure of the channel is entirely 
a-helical within the TMD; amino acid side chains emanating from 
a-helices form the walls of the pore. The pore can be divided into two 
portions: a narrower portion that occupies the outer leaflet of the mem- 
brane and contains the selectivity filter and a wider ‘vestibule’ portion 
located in the inner leaflet (Fig. 3). In the outer leaflet, TM2 lines the 
pore and TM1 is located at the periphery, whereas both TM1 and TM2 
line the vestibule. The WDXXEP signature sequence (WDVMEP in 
CeMCU) adopts an a-helical secondary structure within TM2, forming 
approximately three turns of this helix with the W residue (Trp224) at 
its N-terminal end. The side chains of the D and E residues (Asp225 
and Glu228) protrude into the pore. The W and P residues (Trp224 and 
Pro229) are tightly packed and generally oriented towards the pore, 
although they are not expected to interact with Ca”*. The X residues 
(Val226 and Met227), face away from the pore and would be exposed 
to the membrane in a cellular setting (Fig. 4b). Pro229 induces a slight 
bend in the TM2 helix owing to local disruption of the a-helical hydro- 
gen bonding (Fig. 4b). 

To assess the relevance of the conformation of the WDXXEP 
sequence observed in CeMCU to human MCU, we introduced 
alanine and other mutations at each position of the signature sequence 
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Fig. 3 | Ion pore. Within a ribbon representation of two subunits of 
CeMCU is a representation (grey surface) of the minimal radial distance 
from the centre of the pore to the nearest van der Waals protein contact. 
Selected amino acids are shown as sticks. 


of human MCU (WDIMEP) and evaluated mitochondrial Ca?* uptake 
of the mutants by co-transfecting with wild-type human EMRE in 
mammalian cells in which the endogenous MCU and EMRE genes 
had been disrupted (MCU/EMRE knockout cells). Consistent with a 
pattern predicted by the structure, and with previous mutagenesis of 
the acidic amino acids*, Ca”* uptake was not observed when pore- 
oriented amino acids (W, D, E, or P) were mutated, but was observed 
for mutants of either X residue, which face the membrane in the 
structure of CeMCU (Fig. 4d and Extended Data Fig. 2d-g). The first 
X residue, Ile262 of human MCU, was particularly insensitive to muta- 
tion. The corresponding side chain of CEMCU (Val226) has no protein 
contacts in the structure. Both the 1262V mutant, in which the signa- 
ture sequence is converted to that of CeMCU, and the 1262A mutant 
had full activity. M263A showed diminished but significant activity 
relative to the wild type, and this is consistent with the observation 
that while Met227 of CeMCU faces the membrane it also interacts with 
TM1 (Fig. 4b). These data support the conclusion that the signature 
sequence adopts similar conformations in metazoan and fungal MCU 
channels. 


Selectivity filter 

The selectivity filter is formed by symmetrical arrangement of WDXXEP 
signature sequences around the pore (Fig. 4). This region is both the 
narrowest portion of the pore and the only one that is lined by acidic 
amino acids. Its -helical secondary structure is distinct from channels 
belonging to the superfamilies of CaV, NaV, Kv, TRP, RyR, and IP3R 
cation channels, which possess loop-like secondary structures within 
their selectivity filters*°-?”. The D (Asp225) residues of the tetrameric 
channel form an acidic mouth on the side of the pore facing the inter- 
membrane space, giving the pore a radius of approximately 2.2 A in 
this region (Fig. 3). The E (Glu228) residues form a second acidic ring, 
with a pore radius of less than 1 A (Fig. 3). These glutamate residues 
extend fully away from the helical axis of TM2 and point towards the 
centre of the pore (Fig. 4c). The substitution of an aspartate residue 
would markedly change the pore dimensions. Accordingly, the corre- 
sponding E264D mutation of human MCU is defective in mitochondrial 
Ca** uptake!®, Each W side chain (Trp224) is oriented down such that 
it interdigitates with the E (Glu228) side chains and appears to stabi- 
lize the conformation of the selectivity filter from behind (Fig. 4b, c). 
Each W side chain also packs against the P (Pro229) residue from a 
neighbouring subunit and makes both van der Waals interactions and 
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Fig. 4 | Selectivity filter. a, Density within the selectivity filter region 

of CeMCU. The map is contoured at 80 (blue mesh) and 200 (magenta 
mesh) and shown in the context of two TM2 helices (sticks; other regions 
are omitted for clarity). Green spheres indicate Ca?* ions in sites 1 and 2; 
a grey sphere indicates site 3. b, Side view of the selectivity filter region, 
showing two subunits as ribbons and the WDXXEP signature sequence 
as sticks. Dashed lines indicate hydrogen bonds and Ca”* coordination. 
c, Close up of site 2, oriented approximately 60° from the plane of the 
membrane, showing all four subunits. d, Mitochondrial Ca** uptake by 
indicated mutants of human MCU relative to wild type (WT) when co- 
expressed with EMRE (mean + s.d., independent experiments: n = 8 for 
wild type, n=9 for M263A, and n=3 for the remainder, Extended Data 
Fig. 2d-g). Western blots demonstrate expression. —ve, untransfected cells. 


hydrogen bonds with neighbouring E (Glu228) side chains (Fig. 4c). 
The W residue is particularly sensitive to mutation; Ca?* uptake was 
not observed for any of the mutants tested (alanine, phenylalanine or 
tyrosine; Fig. 4d). 


Ca**+ binding 

Roughly spherical density is located at three sites in the upper portion 
of the pore (Fig. 4a). The sites are evenly spaced approximately 6 A 
apart and located on the symmetry axis. Densities in site 1 and site 
2 are within the selectivity filter and can be ascribed to Ca** ions on 
the basis of their chemical environments. Density in site 3, which is 
located in the upper portion of the vestibule, is harder to assign as it 
is approximately 6 A away from the surrounding Thr231 and Tyr232 
residues (measured to atom centres) and may represent Ca”* or another 
small molecule. Site 1 is positioned in the centre of the ring of Asp225 
residues. The distance between site 1 and each Asp225 carboxylate 
is approximately 4 A (between atom centres), which suggests water- 
mediated interactions with these side chains. Site 2 is located within the 
ring of Glu228 residues and only about 2.3 A from their carboxylates, 
indicating direct coordination. 

The surface of the channel facing the intermembrane space is 
strongly electronegative (Extended Data Fig. 7e). In addition to the 
ring of Asp225 residues surrounding site 1, Asp221 (Glu257 in human 
MCU) is present on each turret (Fig. 3). The preponderance of electro- 
negativity would tend to concentrate cations at the entrance to the pore. 
We envision the following scenario. A Ca”* ion reaching the pore 
would remain hydrated in site 1. As it moves inward towards site 2 it 
would shed water to directly interact with the ring of Glu228 residues, 
perhaps retaining axial water molecules above and below it. On the 
basis of this direct coordination, and as site 2 has the strongest density 
in the entire cryo-EM map (Fig. 4a), we suspect that site 2 represents 
the high-affinity binding site for Ca”* in MCU. After emerging from 
the selectivity filter, a Ca’* ion could become more hydrated in the 
vestibule (for example, in site 3). We hypothesize that ion selectivity 


© 2018 Springer Nature Limited. All rights reserved. 


Outer leaflet lipid 


Acyl density 


Fenestrations 
(membrane 
openings) 


Matrix 


Fig. 5 | Membrane interactions. a, Ordered lipids within the 
transmembrane region. Densities adacent to each subunit (blue mesh, 

80 contour) are ascribed to a lipid of the outer leaflet and an acyl 

chain within a lateral membrane opening (fenestration). The partially 
transparent molecular surface (grey) reveals a ribbon representation of 
the channel, in which each subunit is coloured uniquely. b, Close-up view, 
orthogonal to a, from within the matrix, showing densities corresponding 
to the four acyl chains within the fenestrations (site 3 is a green sphere). 

c, The vestibule is hydrophobic. A slice of the molecular surface shown 
from a (the portion facing the viewer is removed) is coloured according 


for Ca?* involves a knock-off mechanism between sites 1 and 2. In this 
mechanism, a Ca** ion moving inward from site 1 would displace the 
Ca? ion in site 2 into the vestibule, and site 1 would be refilled from 
the intermembrane space. Such a mechanism, involving multiple ions 
in single file, can allow rapid ion throughput in the context of high 
selectivity”’, 

Site 2 may underlie another characteristic of MCU. Unlike many 
Ca’* channels that efficiently conduct both Na* and K* when Ca?* 
and other divalent cations are removed, MCU efficiently conducts 
Na* but not K* under these conditions®. The dimensions of the selec- 
tivity filter at site 2 may explain this electrophysiological hallmark. 
We hypothesize that Na* could pass through site 2 when it is not occu- 
pied by Ca?* because Nat and Ca?* have nearly the same ionic radius 
(1.02 A and 1.00 A, respectively). On the other hand, K* has an ionic 
radius of 1.38 A and would be too large to fit in site 2. Correspondingly, 
the selectivity filter in K* channels is slightly wider than in MCU. In 
the K* channel KcsA”’, for example, the oxygen atoms of the selec- 
tivity filter that coordinate K* are separated by 4.7 A across the pore 
in comparison to approximately 3.9 A separation in MCU (measured 
between atom centres). This discrimination on the basis of size is an 
indication that the selectivity filter of MCU is relatively rigid at site 2. 
We postulate that site 2 preferentially binds Ca** over Na‘ because its 
+2 charge is better matched to direct coordination by the glutamate 
residues, and that site 2 would be constitutively occupied by Ca?* under 
cellular conditions. When sites 1 and 2 are occupied, the local concen- 
tration of Ca?* in the selectivity filter would be approximately 8 M 
(2 ions in a volume of approximately 4 x 10~?° 1), which is more 
than 1 million times the typical concentration of Ca?* in the cytosol. 
Selectivity for Ca** in MCU appears to be a combination of attractive, 
repulsive and excluded-volume forces. 


The N-terminal domains 

Each individual NTD of CeMCU adopts the same structure (Ca root 
mean square deviation (r.m.s.d.) =0.5 A), and superposes with X-ray 
structures of isolated NTDs of human MCU (Ca rm.s.d.=2 A)*%3!, 
despite 16% sequence identity (Extended Data Fig. 9a). Inspection of 
the crystal lattice from the X-ray structures reveals an arrangement 
of NTDs that is similar to the crescent we observe in the structure of 
the full-length zebrafish channel (Extended Data Fig. 9b, c). This cor- 
respondence suggests that the NTDs of human MCU adopt a similar 
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to electrostatic potential: light grey regions are neutral; red, —8 kT e'; 
blue, +8 kT e~!. Green spheres indicate sites 1-3. One subunit is 

shown as yellow ribbons with amino acids facing the vestibule drawn 

as sticks: Ile198, Ala199, Gly202, Leu206 and Tyr209 from TM1 and 
Tyr232, Gly235, Leu236, Thr238, Leu239, Tyr243 and Phe246 from TM2; 
the corresponding amino acids in human MCU are: Val235, Leu236, 
Gly239, Met243, Gln246, Tyr268, Thr271, Tyr272, Ser274, Ala275, Tyr279 
and Phe282, respectively. Basic residues at the matrix boundary of the 
membrane (for example, Arg193, Arg197 and Arg251 of CCMCU) may 
interact with phospholipid headgroups. 


organization in the assembled channel. We find that, like human 
MCu'®”®, zebrafish MCU can catalyse Ca** uptake without an NTD 
(Extended Data Fig. 2a-c). Nevertheless, it is possible that the NTDs 
have regulatory properties, which may include modulation by matrix 
cations*! and binding of MCUR1*”. 


Membrane interactions 

Densities ascribed (on the basis of shape and position) to four lipid 
molecules per channel are located on the membrane-exposed surfaces 
adjacent to the TM2 helices (Fig. 5a). Although the type of lipid cannot 
be discerned from the density, the lipids would belong to the outer leaf- 
let and are oriented in a conventional manner with their head groups 
facing the intermembrane space. 

Within the inner leaflet, TM1 and TM2 of the same subunit are sep- 
arated by a few Angstroms and this creates lateral membrane openings 
(‘fenestrations’) in the transmembrane molecular surface that would 
expose the vestibule to the membrane (Fig. 5a). Four horizontally 
oriented tubes of density consistent with acyl chains of lipid molecules 
are observed within the fenestrations just below site 3 (Fig. 5a, b). The 
densities are positioned similarly to acyl chains observed in fenestra- 
tions of certain other cation channels, including two-pore domain 
potassium channels and a voltage-dependent sodium channel*!*?-**, 
A minor conformational change in the transmembrane helices, such 
as a hinge-like motion at or near Pro299 or Gly235 on TM2 (Fig. 3), 
could seal the fenestrations, raising hypotheses regarding possible ion 
permeation and gating mechanisms and the potential involvement of 
lipids in them. 


Discussion 

The presence of lipids in the vestibule might prevent ion permeation, 
and thus a wider lower portion of the channel, as observed in the 
structure of CeMCU, may represent a non-conductive (‘closed’) con- 
formation of the channel. On the other hand, with its wide vestibule, 
the structure may represent a conductive conformation. If this is the 
case, perhaps lipids are involved in the inward rectification exhibited 
by MCU, wherein the channel preferentially catalyses the flow of Ca?* 
towards the matrix®. In this mechanism, Ca” ions emerging from the 
selectivity filter into site 3 would be pulled inward by the voltage gra- 
dient across the membrane and could transiently displace lipids along 
their path through the vestibule. Ca?* ions would not have sufficient 
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potential energy to efficiently pass through the lipids without the neg- 
ative voltage of the matrix. In the absence of inward ion flow, the vesti- 
bule could fill with lipids and MCU would present itself to the matrix as 
collection of transmembrane helices with intervening lipids rather than 
an aqueous pore. These lipids ‘covering up the selectivity filter would 
tend to prevent outward ion flow and this would be manifest to the 
experimentalist as inward rectification when applying a positive voltage 
inside the matrix. The legs of the channel appear suited to allow lipid 
movement. Entire lipid molecules of the inner leaflet, along with their 
head groups, could diffuse in and out of the vestibule between the legs. 
In line with this hypothesis, the surfaces of TM1 and TM2 lining the 
vestibule are hydrophobic and basic residues on the legs appear poised 
to interact with phospholipid head groups (Fig. 5c). Analogous residues 
are present in human MCU. The structure establishes a foundation to 
probe lipid involvement. 

The structures of zebrafish MCU and CeMCU reveal a new architec- 
ture for a Ca” channel. The pore of MCU differs from the large super- 
family of cation channels that share a pore architecture that was first 
observed for the K+ channel KcsA*°. Unlike those channels, which have 
loop-like secondary structures in their selectivity filters, the selectivity 
filter of MCU is formed by a-helices that coordinate Ca** with rings of 
acidic amino acids. The structures suggest that MCU combines a multi- 
ion selectivity filter, a discriminatory Ca”* binding site, an electroneg- 
ative surface exposed to the intermembrane space, and a vestibule to 
lower energy barriers for Ca”* influx into mitochondria and achieve high 
selectivity for Ca”. They also indicate potential sites of channel regu- 
lation, not only by EMRE, MICU1 and MICU2 in the membrane and 
intermembrane space but also by other functionalities within the matrix. 
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Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0331-8. 


Received: 4 March 2018; Accepted: 19 June 2018; 
Published online 11 July 2018. 


1. Deluca, H. F. & Engstrom, G. W. Calcium uptake by rat kidney mitochondria. 
Proc. Natl Acad. Sci. USA 47, 1744-1750 (1961). 
2. ontero, M. et al. Chromaffin-cell stimulation triggers fast millimolar 
mitochondrial Ca2* transients that modulate secretion. Nat. Cell Biol. 2, 57-61 
(2000). 
3. Baughman, J. M. et al. Integrative genomics identifies MCU as an essential 
component of the mitochondrial calcium uniporter. Nature 476, 341-345 
(2011). 
4. De Stefani, D., Raffaello, A., Teardo, E., Szabé, |. & Rizzuto, R. A forty-kilodalton 
protein of the inner membrane is the mitochondrial calcium uniporter. Nature 
476, 336-340 (2011). 

5. Chaudhuri, D., Sancak, Y., Mootha, V. K. & Clapham, D. E. MCU encodes the pore 
conducting mitochondrial calcium currents. eLife 2, e00704 (2013). 

6. Kamer, K. J. & Mootha, V. K. The molecular era of the mitochondrial calcium 
uniporter. Nat. Rev. Mol. Cell Biol. 16, 545-553 (2015). 

7. Gunter, K. K. & Gunter, T. E. Transport of calcium by mitochondria. J. Bioenerg. 
Biomembr. 26, 471-485 (1994). 

8. Kirichok, Y., Krapivinsky, G. & Clapham, D. E. The mitochondrial calcium 
uniporter is a highly selective ion channel. Nature 427, 360-364 (2004). 

9. Bick, A. G., Calvo, S. E. & Mootha, V. K. Evolutionary diversity of the 

mitochondrial calcium uniporter. Science 336, 886 (2012). 

0. Sancak, Y. et al. EMRE is an essential component of the mitochondrial calcium 

uniporter complex. Science 342, 1379-1382 (2013). 

1. Perocchi, F. et al. MICU1 encodes a mitochondrial EF hand protein required for 

Ca?* uptake. Nature 467, 291-296 (2010). 

2. Plovanich, M. et al. MICU2, a paralog of MICU1, resides within the 

mitochondrial uniporter complex to regulate calcium handling. PLoS One 8, 

e55785 (2013). 

3. Mallilankaraman, K. et al. MCUR1 is an essential component of mitochondrial 
Ca?+ uptake that regulates cellular metabolism. Nat. Cell Biol. 14, 1336-1343 
(2012). 

4. Vais, H. et al. MCUR1, CCDC90A, is a regulator of the mitochondrial calcium 
uniporter. Cell Metab. 22, 533-535 (2015). 


584 | NATURE | VOL 559 | 26 JULY 2018 


5. Chaudhuri, D., Artiga, D. J., Abiria, S. A. & Clapham, D. E. Mitochondrial calcium 
uniporter regulator 1 (MCUR1) regulates the calcium threshold for the 
mitochondrial permeability transition. Proc. Nat! Acad. Sci. USA 113, 
E1872-E1880 (2016). 

6. Kovacs-Bogdan, E. et al. Reconstitution of the mitochondrial calcium uniporter 
in yeast. Proc. Natl Acad. Sci. USA 111, 8985-8990 (2014). 

7. Tsai, M.-F. et al. Dual functions of a small regulatory subunit in the 
mitochondrial calcium uniporter complex. eLife 5,e15545 (2016). 

8. Oxenoid, K. et al. Architecture of the mitochondrial calcium uniporter. Nature 
533, 269-273 (2016). 

9. Moraes, |., Evans, G., Sanchez-Weatherby, J., Newstead, S. & Stewart, P. D. 
Membrane protein structure determination — the next generation. Biochim. 
Biophys. Acta 1838 (1 Pt A), 78-87 (2014). 

20. Long, S. B., Tao, X., Campbell, E. B. & MacKinnon, R. Atomic structure of a 
voltage-dependent Kt channel in a lipid membrane-like environment. Nature 
450, 376-382 (2007). 

21. Payandeh, J., Scheuer, T., Zheng, N. & Catterall, W. A. The crystal structure of a 
voltage-gated sodium channel. Nature 475, 353-358 (2011). 

22. Wu, J. et al. Structure of the voltage-gated calcium channel Cav1.1 complex. 
Science 350, aad2395 (2015). 

23. Shen, H. et al. Structure of a eukaryotic voltage-gated sodium channel at 
near-atomic resolution. Science 355, eaal4326 (2017). 

24. Yan, Z. et al. Structure of the rabbit ryanodine receptor RyR1 at near-atomic 
resolution. Nature 517, 50-55 (2015). 

25. Zalk, R. et al. Structure of a mammalian ryanodine receptor. Nature 517, 44-49 
(2015). 

26. Fan, G. et al. Gating machinery of InsP3R channels revealed by electron 
cryomicroscopy. Nature 527, 336-341 (2015). 

27. Liao, M., Cao, E., Julius, D. & Cheng, Y. Structure of the TRPV1 ion channel 
determined by electron cryo-microscopy. Nature 504, 107-112 (2013). 

28. Hille, B. lonic Channels of Excitable Membranes 3rd edn (Sinauer Associates, 
Sunderland, 2001). 

29. Zhou, Y., Morais-Cabral, J. H., Kaufman, A. & MacKinnon, R. Chemistry of ion 
coordination and hydration revealed by a Kt channel-Fab complex at 2.0 A 
resolution. Nature 414, 43-48 (2001). 

30. Lee, Y. et al. Structure and function of the N-terminal domain of the human 

mitochondrial calcium uniporter. EMBO Rep. 16, 1318-1333 (2015). 

31. Lee, S. K. et al. Structural insights into mitochondrial calcium uniporter 

regulation by divalent cations. Cell Chem. Biol. 23, 1157-1169 (2016). 

32. Miller, A. N. & Long, S. B. Crystal structure of the human two-pore domain 

potassium channel K2P1. Science 335, 432-436 (2012). 

33. Brohawn, S. G., Campbell, E. B. & MacKinnon, R. Physical mechanism for gating 

and mechanosensitivity of the human TRAAK Kt channel. Nature 516, 126-130 

(2014). 

34. Dong, Y. Y. et al. K2P channel gating mechanisms revealed by structures of 

TREK-2 and a complex with Prozac. Science 347, 1256-1259 (2015). 

35. Doyle, D.A. et al. The structure of the potassium channel: molecular basis of Kt 

conduction and selectivity. Science 280, 69-77 (1998). 


Acknowledgements We thank N. Grigorieff, members of his laboratory, and 

the staff at the Howard Hughes Medical Institute Cryo-EM facility for training 

in cryo-EM; R. K. Hite and members of the Long laboratory for discussions; the 
staff of the New York Structural Biology Center Simons Electron Microscopy 
Center, M. Ebrahim, and M. J. de la Cruz of the Memorial Sloan Kettering 
Cancer Center Cryo-EM facility for help with data collection; and J. Goldberg 

for spectrophotometer use. This work was supported, in part, by an NIH core 
facilities grant to Memorial Sloan Kettering Cancer Center (P30 CA008748), by 
an NIH Medical Scientist Training Program grant (T32GM007739 for A.F.S.), and 
by an NIH grant (RO1GM094273 to S.B.L.). 


Reviewer information Nature thanks P. Yuan and the other anonymous 
reviewer(s) for their contribution to the peer review of this work. 


Author contributions R.B. and C.W. performed cryo-EM studies of CeMCU and 
zebrafish MCU, respectively, and other experiments. A.F.S. developed MCU and 
EMRE double-knockout cells. S.B.L. directed the research and assisted with 
cryo-EM. All authors contributed to data analysis and the preparation of the 
manuscript. 


Competing interests The authors declare no competing interests. 


Additional information 

Extended data is available for this paper at https://doi.org/10.1038/s41586- 
018-0331-8. 

Supplementary information is available for this paper at https://doi.org/ 
10.1038/s41586-018-0331-8. 

Reprints and permissions information is available at http://www.nature.com/ 
reprints. 

Correspondence and requests for materials should be addressed to S.B.L. 
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional 
claims in published maps and institutional affiliations. 


© 2018 Springer Nature Limited. All rights reserved. 


METHODS 

Cloning, expression and purification of CeMCU. Cyphellophora europaea MCU 
(CeMCU; UniProt accession W2SDE2) was selected as a candidate for structural 
studies from among numerous MCU orthologues that were evaluated using 
fluorescence-detection size-exclusion chromatography (FSEC) with HEK-293 
cells*®. The cDNA of CeMCU (synthesized by BioBasic) was ligated into the 
EcoRI-Sall restriction sites of the Pichia pastoris expression vector pPICZ-C 
(Invitrogen Life Technologies), encoding the full-length protein followed by a 
C-terminal antibody-affinity tag (GEGEEF) that is recognized by the anti-tubulin 
antibody YL1/23”. Transformation into P pastoris (SMD1168 strain, Invitrogen), 
expression and cryo-lysis were performed as previously described”. 

Mild detergents (n-dodecyl-6-p-maltoside and digitonin) that are commonly 
used for structural studies of membrane proteins’? were used for preparation of 
CeMCU and zebrafish MCU. All purification steps were carried out at 4°C. For 
CeMCU, lysed P. pastoris cells (~150 g) were re-suspended in 1.5 L of RB buffer 
containing 20 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES), 
pH7.5, 150mM NaCl, 1 mM CaCl, 0.15 mg/mL DNase I (Sigma-Aldrich), 1.5 pg/mL 
Leupeptin (Sigma-Aldrich), 1.5 jug/mL Pepstatin A (Sigma-Aldrich), 1 mM 
Benzamidine (Sigma-Aldrich), 1 mM 4-(2-aminoethyl) benzenesulfony] fluo- 
ride hydrochloride (AEBSF, Gold Biotechnology), 1 mM phenylmethanesulfonyl 
fluoride (PMSE, Acros Organics) and 1:500 dilution of Aprotonin (Sigma-Aldrich). 
Cell debris was removed by centrifugation (3 x 7000g for 20 min), and cellular 
membranes were collected by centrifugation at 43,000g for 20 min. The pelleted 
membranes were resuspended in 100 mL of RB buffer supplemented with 20% (v/v) 
glycerol, flash-frozen in liquid nitrogen, stored at —80°C. Membranes (~33 mL, 
corresponding to ~40 g of lysed cells) were thawed, diluted with equal volume of 
RB buffer, and solubilized with 2 g of n-dodecyl-6-maltoside (DDM, Anatrace) 
by rotating for 1 h at 4°C. The sample was then centrifuged at 43,000g for 45 min 
and the supernatant was filtered through a 0.22-1m polystyrene membrane 
(Millipore). YL1/2 antibody (IgG, expressed from hybridoma cells and purified 
by ion-exchange chromatography using standard methods) was coupled to CNBr- 
activated Sepharose beads (GE Healthcare) according to manufacturer’s protocol. 
Approximately 0.4 mL of YL1/2 antibody beads were added to the sample for each 
1 g of P. pastoris cells and the mixture was rotated at 4°C for 1 h. Antibody beads 
were collected on a column support, washed with ~5 column volumes of a buffer 
containing 20 mM HEPES, pH 7.5, 150 mM NaCl, 1 mM CaCl, 2 mM DDM and 
0.1 mg/mL 1',3’-bis[1,2-dioleoyl-sn-glycero-3-phospho]-sn-glycerol (18:1 cardi- 
olipin, Avanti Polar Lipids), and the protein was eluted with a buffer containing 
100 mM HEPES, pH 7.5, 150 mM NaCl, 1 mM CaCl, 2 mM DDM, 0.1 mg/mL 
cardiolipin and 1 mM Glu-Glu-Phe peptide (Peptide 2.0). The elution fraction was 
concentrated to ~5 mg/mL using a 100 kDa concentrator (Amicon Ultra; EMD 
Millipore) and run on a Superose-6 10/300 Increase size-exclusion chromatography 
(SEC) GL column, equilibrated with a buffer containing 20 mM HEPES pH 7.5, 
150 mM NaCl, 1 mM CaCh, 0.5 mM digitonin (Cayman Chemical Company) and 
0.05 mg/mL cardiolipin. (Digitonin was used at the end of the purification proce- 
dure because it produced more uniform ice on the cryo-EM grids than DDM.) The 
elution fractions containing the CeMCU protein were pooled and concentrated to 
~5 mg/mL using a 100 kDa concentrator (Vivaspin-2). Purified protein was used 
immediately for cryo-Electron Microscopy (cryo-EM) grid preparation. 
Cloning, expression and purification of zebrafish MCU. Danio rerio (zebrafish) 
MCU was selected as a candidate for structural studies on the basis of good protein 
expression in HEK293S GnTI ~ cells (ATCC, not tested for mycoplasma contam- 
ination), the ability to catalyse mitochondrial calcium uptake when co-expressed 
with EMRE, and good biochemical stability in detergent-containing solutions. The 
cDNA of zebrafish MCU (UniProt accession zgc:153607, synthesized by Synbio 
Tech Inc.) was ligated into the XhoI-EcoRI restriction sites of a BacMam expres- 
sion vector’, encoding the full-length protein followed by a C-terminal 1D4 tag 
(TETSQVAPA) that is recognized by the Rho-1D4 monoclonal antibody*®. The 
resulting plasmid was transformed to DH10Bac E. coli competent cells to generate 
bacmid. Recombinant P1 baculoviruses were generated in Sf9 cells (Invitrogen, 
not tested for mycoplasma contamination) cultured in sf-900 II] SFM medium 
(GIBCO) using cellfectin II reagents (Invitrogen). For protein expression, 100 mL 
P3 virus was used to infect 1 L culture of HEK293S GnTI ~ cells (3 x 10° cells/mL) 
in Freestyle 293 media (GIBCO) supplemented with 2% FBS (GIBCO, catalogue 
A31606-02). After infection, the cells were incubated at 37 °C for 16 h, and then 5 
mM sodium butyrate (Sigma-Aldrich) was added to the cell culture and cells were 
cultured at 30°C for another 72 h before harvest. 

2 L of cell pellet was resuspended in 100 mL lysis buffer (40 mM HEPES pH 7.4 
150 mM NaCl, 5 (tM CaCl, 0.15 mg/ml DNase I, 1.5 jxg/ml Leupeptin, 1.5 jxg/ml 
Pepstatin A, 1 mM AEBSF, 1 mM Benzamidine, 1 mM PMSF and 1:500 dilution 
of Aprotinin), and then solubilized by adding lauryl maltose neopentyl glycol 
(LMNG, Anatrace, from a 10% stock in water) to a final concentration of 1% and 
stirred at 4°C for 1.5 h. Detergent-solubilized material was separated from the 
insoluble fraction by centrifugation at 43,000g for 1 h and the supernatant was 
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filtered through a 0.22-|1m polystyrene membrane (Millipore). Rho-1D4 mono- 
clonal antibody was coupled to CNBR-activated Sepharose beads (GE healthcare) 
according to the manufacturer’s protocol. 5 mL Rho-1D4 monoclonal antibody 
beads were added to the sample and rotated at 4°C for 2 h. The beads were washed 
with 100 mL of buffer A (20 mM HEPES pH 7.4 150 mM NaCl, 5 pM CaCh, 0.5 mM 
digitonin and 0.01 mg/ml 18:1 cardiolipin). Zebrafish MCU protein was eluted by 
incubating with buffer A containing 2 mg/ml 1D4 peptide (TETSQVAPA, syn- 
thesized by Peptide 2.0) overnight at 4°C, and further purified by size-exclusion 
chromatography on a Superose 6 increase, 10/300 GL column (GE Healthcare) 
equilibrated with buffer B (20 mM Tris(hydroxymethyl)-aminomethane (Tris) 
pH 8.5, 150 mM NaCl, 5 tM CaCh, 0.5 mM digitonin, 0.01 mg/ml cardiolipin and 
0.4 mM Tris(2-carboxyethyl)phosphine). The peak fractions were pooled, concen- 
trated to 5 mg/ml using a 100 kDa concentrator (Vivaspin-2), and immediately 
used for cryo-EM grid preparation. 
EM sample preparation and data acquisition. 5 |1L of 5 mg/mL freshly purified 
CeMCU and zebrafish MCU protein samples were loaded into glow-discharged 
(10 s) Quantifoil R1.2/1.3 holey carbon grids (Au 400, Electron Microscopy 
Sciences) and plunge-frozen in liquid nitrogen-cooled liquid ethane, using a 
Vitrobot Mark IV (FEI) operated at room temperature with a blotting time of 
2s under blot force of 0 at 100% humidity. Grids were clipped and loaded into a 
300 keV Titan Krios microscope (FEI) equipped with a Gatan K2 Summit direct 
electron detector (Gatan). Grids were screened on the bases of particle distribution 
and density, and images from the best ones were recorded. The CEMCU micro- 
graphs were collected at a magnification of 29,000 x with a super-resolution pixel 
size of 0.4307 A and a defocus range of —0.9 to —2.9 jm. The dose rate was 8 
electrons per physical pixel per second, and images were recorded for 8 s with 0.2 s 
subframes (40 total frames) corresponding to a total dose of approximately 80 elec- 
trons per A”. The micrographs for zebrafish MCU were collected at a magnification 
of 37,000 x with a super-resolution pixel size of 0.346 A and a defocus range of 
—1.2 to —3.2 jum. The dose rate was 8 electrons per physical pixel per second and 
images were recorded for 6 s with 0.13 s subframes (46 total frames) corresponding 
to a total dose of approximately 100 electrons per A”. 
Image processing of CEMCU. Extended Data Figure 3 shows the cryo-EM work- 
flow of CeMCU. Movie stacks were gain-corrected and twofold Fourier cropped to 
a calibrated pixel size of 0.8613 A, and motion corrected and dose weighted using 
MotionCor2*!. Contrast Transfer Function (CTF) estimates for motion-corrected 
micrographs were performed in CTFFIND4 using frames 1-40**. Subsequent 
image processing was carried out with RELION 2.1”, using a particle box size of 
384 pixels and a mask diameter of 160 A, unless stated otherwise. The initial data 
set (~3700 movies) was obtained from two equivalent data collection sessions. 
Each micrograph was inspected manually; poor-quality ones and those having 
CTF estimation fits to lower than 5 A were discarded. Approximately 2000 particles 
were picked manually and these were used for reference-free 2D classification to 
generate templates for automatic particle picking. Auto-picking (using 8 of the rep- 
resentative 2D classes containing different orientations as templates) from ~3,700 
micrographs yielded 240,516 particles, which were further cleaned-up by remov- 
ing outlier particles using two rounds of reference-free 2D classification with 100 
classes (for example, to remove contaminants or empty micelles), and resulted in 
201,281 particles. An initial 3D model from these particles was generated without 
imposed symmetry (C1 symmetry) with RELION 2.1. Using this initial model, 
3D classification with C1 symmetry was performed, sorting the particles into 10 
classes. The 10 classes had similar shapes and all indicated a tetrameric archi- 
tecture. By visual inspection in UCSF Chimera“, 8 “good” classes (representing 
171,407 particles) were combined and subjected to 3D refinement. This yielded 
a reconstruction at 4.8 A overall resolution in Cl symmetry. “Polished” particle 
images, which are corrected for individual particle movements, were generated in 
Relion using aligned movie frames that were output from MotionCor2. 3D refine- 
ment using the 171,407 polished particles and a mask that removes the micelle 
region yielded a reconstruction to 4.5 A overall resolution with C1 symmetry. Well- 
defined density for a-helices enabled determination of the appropriate hand for the 
reconstruction at this point (Extended Data Fig. 3). The C2 overall symmetry of 
the molecule was evident from this map: it has fourfold symmetry within the TMD 
and twofold symmetry within the matrix region (Fig. la). Refinement using C2 
symmetry in RELION 2.1 improved the overall resolution to 4.1 A. All resolution 
estimates are based on gold-standard Fourier shell correlation (FSC) calculations. 
To improve the reconstruction further, an additional ~2340 micrographs were 
collected using the same microscope and the same data collection strategy. These 
were motion-corrected using MotionCor2 as indicated above. Auto-picked par- 
ticles were subjected to one round of reference-free 2D classification and merged 
with the previous data set at the “un-polished” stage to yield a total of 547,637 
particles. 3D refinement in RELION 2.1 was performed using all of these particles 
(with C2 symmetry) and the particles were “polished” together as a single data set 
using methods described above. The polished particles were then subjected to 
3D classification, with C2 symmetry, sorting them into 10 classes. 8 good classes 
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containing 376,541 polished particles were combined and subjected to 3D refine- 
ment in RELION 2.1 but this did not improve the overall resolution beyond 4.1 A. 
All of these particles were imported into the cisTEM cryo-EM software package for 
further refinement’. 3D refinement in cisTEM, using the previously used mask, 
but where the micelle region outside the mask was low-pass filtered at 15 A rather 
than removed, yielded the final reconstruction at 3.2 A overall resolution. We 
suspect that this low-pass filter and the weighting scheme used for reconstruction 
in cisTEM were the primary reasons for the improved resolution*’. The map was 
B-factor sharpened using a B-factor of —90 A? and low-pass filtered at 3.2 A res- 
olution (using cisTEM). Independent half-maps were generated and only one of 
them was used to build and refine the atomic model; the other was used only for 
validation. We note that the additional data set was important to extend the reso- 
lution of the reconstruction; refinement in cisTEM using only the initial 171,407 
polished particles yielded a reconstruction to 3.8 A overall resolution. Estimation 
of the local resolution of the map was done using blocres*®. The maps of COoMCU 
shown in Figures are combined maps, which were sharpened (using a B-factor of 
—90 A?) and low-pass filtered at 3.2 A using cisTEM. 

Model building and refinement. The atomic model of CE€MCU was manually 
built into one of the half-maps (sharpened using a B-factor of —90 A? and low-pass 
filtered at 3.2 A resolution) and refined in real space using the COOT software’. 
The crystal structure of the NTD from human MCU (PDB: 5KUI) was used as a 
reference for the corresponding regions. Further refinement of the atomic model 
was carried out in PHENIX“* using real-space refinement against the same half- 
map. The final model has good stereochemistry and good Fourier shell correlation 
with the other half-map as well as the combined map (Extended Data Fig. 6e and 
Extended Data Table 1). Structural Figures were prepared with Pymol (pymol. 
org), Chimera**, and HOLE”. Electrostatic calculations used the APBS*° plugin 
in Pymol. 

Image processing of zebrafish MCU. Extended Data Figure 4 shows the cryo-EM 
workflow for zebrafish MCU. Movie stacks were gain-corrected, twofold Fourier 
cropped to a pixel size of 0.692 A, motion corrected, and dose weighted using 
MotionCor2?. Contrast transfer function (CTF) estimates were performed in 
CTFFIND4 using frames 1-46*. Subsequent image processing was carried out in 
RELION 2.1°. Generating an improved initial model was crucial for 3D refine- 
ment of zebrafish MCU (Extended Data Fig. 4a). The improved initial model 
was generated using a subset of particles but refinement (Extended Data Fig. 4b) 
was done using the majority of particles as described below. Approximately 3000 
particles were manually picked and subjected to 2D classification. Nine 2D-class 
averages were selected as templates for autopicking of 269,574 particles from ~3500 
micrographs in RELION 2.1. After 2D classification to remove a few bad classes, 
203,213 particles were used for ab-initio 3D model generation (with 4 classes) in 
cryoSPARC*. The best 3D reconstruction and the associated particle images were 
used for further 3D classification in RELION 2.1, sorting the particles into 6 classes. 
Three “good” classes (91,727 particles) were combined and subjected to further 3D 
classification (using 6 classes). The best class (representing 15,775 particles) was 
improved by 3D refinement in RELION 2.1 and yielded the improved initial model 
(Extended Data Fig. 4a). At this point, particles from micrographs with CTF fits 
better than 5 A were selected (196,272 particles) and used for homogeneous refine- 
ment in cryoSPARC using the improved initial model (Extended Data Fig. 4b). 
This yielded a reconstruction at 9.1 A overall resolution. All 196,272 particles 
were imported into cisTEM for further refinement®. 3D refinement in cisTEM, 
using a mask to apply a 30 A low-pass filter to the detergent micelle, improved the 
overall resolution to 8.5 A. The map shown in Fig. 2 was B-factor sharpened using 
a B-factor of —90 A? and low-pass filtered at 8.5 A resolution (using cisTEM). 
Crosslinking. For crosslinking of purified CeMCU in detergent, the protein was 
diluted to 0.2 mg/mL in buffer consisting of 20 mM HEPES, pH7.5, 150 mM NaCl, 
1mM CaCh, 1 mM DDM and 0.05 mg/mL cardiolipin. 0 to 1 mM concentrations of 
bis(sulfosuccinimidyl) suberate crosslinker (BS-3, Sigma-Aldrich, from a 17.5 mM 
stock in water) were added and the samples were incubated for 30 min at room 
temperature. The reactions were stopped by the addition of 140 mM Tris, pH 8.5, 
and analysed by 4-12% SDS-polyacrylamide gel electrophoresis under reducing 
conditions (100 mM dithiothreitol) using coomassie staining. For crosslinking 
of purified zebrafish MCU in detergent, the protein was diluted to 0.5 mg/mL in 
buffer consisting of 20 mM HEPES, pH7.5, 150 mM NaCl, 1 mM CaCh, 0.5 mM 
digitonin and 0.01 mg/mL cardiolipin. 0 to 1 mM concentrations of BS-3 
crosslinker were added and the samples were incubated for 60 min on ice. The 
reactions were stopped by the addition of 100 mM Tris, pH 8.5, and analysed by 
12% SDS-polyacrylamide gel electrophoresis under reducing conditions (100 mM 
dithiothreitol) using coomassie staining. 

For crosslinking of human (Hs) MCU in MCU/EMRE knockout cell mem- 
branes, a chimaeric protein, consisting of the mitochondrial targeting sequence 
from D. discoideum MCU, fused to the mature form of human MCU (DdMTS- 
HsMCU) as previously used", followed by cDNA encoding a C-terminal Rho-1D4 
affinity tag (amino acids: TETSQVAPA) and a stop codon, was cloned into the 


XhoI-EcoRI restriction sites of expression vector pCGFP-EU™, transfected into 
MCU/EMRE knockout cells using Lipofectamine 3000 (Invitrogen), and the 
cells were cultured for 48 h at 37°C with 5% COs. Cells were harvested and re- 
suspended in a buffer containing 50 mM HEPES, pH 7.5, 200 mM NaCl, 1 mM 
CaCl, and 1:500 dilution of Protease Inhibitor Cocktail II], EDTA-free 
(CalBiochem), and lysed by dounce homogenization and brief sonication (30 s) 
using a bath sonicator. Cell debris was removed by centrifugation (2500g, 4°C, 
15 min) and the membrane fractions were collected by centrifugation (100,000g, 
4°C, 1h). The membrane fraction pellets were re-suspended in the same buffer 
using a dounce homogenizer followed by brief sonication (30 s), and treated with 
0-575 \M of the membrane permeable crosslinker disuccinimidy] glutarate (DSG, 
Pierce, from a 30 mM stock in DMSO) for 30 min at room temperature. The 
reactions were stopped by the addition of 140 mM Tris, pH 8.5, run on a 4-12% 
SDS-polyacrylamide gel under the reducing conditions (100 mM dithiothreitol), 
and detected by western blot using the Rho-1D4 antibody”. 

Generation of MCU/EMRE knockout HEK293T cells. SMDT1 and MCU, which 
encode human EMRE and MCU, respectively, were simultaneously targeted in 
HEK293T cells (Invitrogen, not tested for mycoplasma contamination) following 
a well-established CRISPR-Cas9 protocol’. In brief, sgRNA sequences target- 
ing coding regions of SMDT1 and MCU were selected using an online design 
tool (http://crispr.mit.edu/). Oligonucleotides incorporating target sequences 
for SMDT1 (5’-caccgATGGCGATGTCTCCGCCGCA-3’, 5’-aaacTGCGGCGG 
AGACATCGCCATc-3’) and MCU (5'/-caccgTGACAGCGTTCACGCCGG 
GA-3’, 5’-aaacTCCCGGCGTGAACGCTGTCAc-3’) were synthesized by IDT, 
annealed and cloned into the pSpCas9(BB)-2A-GFP vector (Addgene plasmid 
# 48138). Correctly inserted sgRNA sequences were verified for the SMDT1- 
and MCU-targeting vectors by Sanger sequencing (Genewiz). HEK293T cells 
were detached with trypsin and seeded into a 24-well plate with approximately 
1.3 x 10° cells per well. Twenty-four hours after seeding, cells were transfected 
with 0.25 1g of each vector using Lipofectamine2000 (Invitrogen). Twenty-four 
hours post-transfection, cells were detached and single GFP-positive cells were 
sorted into individual wells of a 96-well plate containing DMEM (supplemented 
with 20% FBS, 100 U/ml penicillin, and 100 ,»g/ml streptomycin) using a BD 
FACSAria II cell sorter. After sorting, cells were incubated for 2 weeks, during 
which time media was changed every 3-5 days, and wells were monitored for 
the appearance of well-defined single colonies. Colonies were then transferred 
to a 24-well plate for subsequent outgrowth, DNA analysis, and functional 
characterization. Genomic DNA from individual colonies was prepared using 
QuickExtract solution according to the manufacturer's instructions. Loci con- 
taining the targeted sequences were PCR amplified and Sanger sequenced using 
the following primers: 5’-TAACGAAT TCCACAGATAGCACGGGCCTTT-3’, 
5'-TATGGTCGACTAGCGTATTCTTGGTCCGCC-3’ for SMDT1 and 
5/-TAACGAATTCACCCCCATGCAAGAATGGTA-3’, 5’-TATGGTCGACGTT 
CATCCTTGCTCATGGCATT-3’ for MCU. Putative double knockouts that were 
disrupted at both SMDT1 and MCU loci (as judged by Sanger sequencing) were 
verified functionally using the mitochondrial Ca”* uptake assay. 
Mitochondrial Ca?* uptake measurements. The cDNA for zebrafish MCU, 
zebrafish MCUanrtp (deletion from residue 85 to residue 190), zebrafish EMRE 
(UniProt accession: QO8BI9), DAMTS—-HsMCU chimaera, and human EMRE 
(UniProt accession: Q9H4I9) were sub-cloned into the pCGFP-EU mammalian 
expression vector with a C-terminal 1D4 tag (and without GFP)**. Mutations were 
generated using Q5 Site-Directed Mutagenesis Kit (New England BioLabs Inc.) and 
verified by DNA sequencing. 

1.5 jug MCU plasmid and 1.5 jpg EMRE plasmid were co-transfected into 
~2 x 10° MCU/EMRE knockout cells using Lipofectamine2000 (Invitrogen). 
After transfection, the cells were cultured at 37 °C for ~48 h and then collected 
for calcium uptake assays and western blotting. 

Calcium uptake assays of permeabilized MCU/EMRE knockout cells were per- 
formed according to a published protocol!®. In brief, ~2 x 10° cells were resus- 
pended in 1 mL of buffer containing 125 mM KCl, 2 mM K,HPO,, 1 mM MgCh, 
10 1M ethylene glycol-bis(2-aminoethylether)-N,N,N‘,N’-tetraacetic acid (EGTA, 
Gold Biotechnology), 5 mM malate, 5 mM glutamate, 5 mM succinate (Sigma- 
Aldrich) and 20 mM HEPES, pH 7.4, and loaded into a stirred quartz cuvette in a 
Hitachi F-2500 spectrophotometer (excitation: 506 nm, slit length: 3 nm, emission: 
531 nm, slit length: 3 nm, sampling frequency: 2 Hz), with the temperature main- 
tained at 22°C by a circulating water bath. In a typical experiment, reagents were 
added into the cuvette in the following order: 1 1M cell-impermeable Calcium 
Green-5N (Life Technologies, from a 1 mM stock solution in water), 0.005% (w/v) 
digitonin (Sigma-Aldrich, from a 0.5% stock solution in water), 5 ,tM CaCl, (from 
a1 mM stock solution in water), and either 1 |1M ruthenium red (Sigma-Aldrich, 
from a 0.2 mM stock solution in water), 5 |tM carbonyl cyanide m-chlorophenyl 
hydrazone (CCCP, Sigma-Aldrich, from a 1 mM stock solution in DMSO) or 10 1M 
ETH129 (Calcium ionophore II, Sigma-Aldrich, from a 2 mM stock solution in 
DMSO). For the data plotted in Fig. 4d, a rate of Ca* uptake was defined as the 
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slope of a linear fit of the Ca** uptake curve between 240 and 260 s (for exam- 
ple, Extended Data Fig. 2d, f). Relative uptake (Fig. 4d) is the ratio of the slopes 
observed for mutant and wild type MCU channels. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Data availability. Atomic coordinates of CEMCU have been deposited in the 
PDB under accession 6DNF. The three-dimensional cryo-EM density maps have 
been deposited in Electron Microscopy Data Bank under accessions EMD-7971 
(CeMCU) and EMD-7972 (zebrafish MCU). 
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Mitochondria-targeting sequence 
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Extended Data Fig. 1 | Structure-based sequence alignment. The amino 
acid sequences of C. europaea, Pyronema omphalodes, D. discoideum, 

C. elegans, zebrafish (D. rerio) and human MCUs are aligned and coloured 
according to the ClustalW convention (UniProt accession numbers: 
W2SDE2, U4LFM6, Q54LT0, Q21121, QO8BI9 and Q8NE86, respectively). 


The secondary structure is indicated with ribbons representing a-helices, 
solid lines representing structured loop regions, and dashed lines 
representing disordered regions. The WDXXEP signature sequence is 
highlighted with a red line. 
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Extended Data Fig. 2 | Functional analysis of MCU. a, b, Representative 
Ca?* uptake experiments using digitonin-permeabilized MCU/EMRE 
knockout cells without transfection (red line), or expressing zebrafish 
MCU and EMRE (black line), or expressing zebrafish MCUanrp and 
EMRE (grey line). Blue arrows indicate additions of 5 tM CaCl. A 
decrease in fluorescence following addition of Ca** is indicative of Ca?* 
uptake (for example, black and grey traces). Ruthenium red (RuRed), a 
proton ionophore (CCCP), or a Ca”* ionophore (ETH129) were added as 
controls towards the end of each experiment as indicated. RuRed prevents 
Ca** uptake, ETH129 demonstrates that the mitochondria are intact, 
and CCCP demonstrates that Ca** uptake is dependent upon the proton 
gradient. Analogous distinct experiments were repeated a total of six 
times and yielded similar results. c, Rho-1D4 western blot analysis of the 
cell lysates from a, demonstrating expression of MCU and EMRE, which 
were C-terminally tagged with a 1D4 peptide (see Methods). Repeated 
twice with similar results. d, f, Representative mitochondrial Ca2t uptake 
experiments using MCU/EMRE knockout cells for the indicated mutations 
of human MCU when co-transfected with EMRE (from which the data 
shown in Fig. 4d are derived, see Methods). Independent experiments 
were repeated with similar results: n = 8 for wild type, n=9 for M263A, 
and n=3 for the remainder of the mutants. e, g, Full-size western 

blots of the cell lysates from d and f showing protein expression levels 


(corresponding to Fig. 4d; detected using a Rho-1D4 antibody). Repeated 
twice with similar results. h-j, Subunit stoichiometry analysis of MCU 
proteins using crosslinking. h, Crosslinking of purified CeMCU in the 
detergent n-dodecyl-3-p-maltoside (DDM). Indicated concentrations 

of the crosslinker bis(sulfosuccinimidyl) suberate (BS-3) were incubated 
with purified CeMCU. Analysis is by SDS-PAGE and coomassie stain. 
Molecular weight standards are located in the first lane and their positions 
indicated. The calculated molecular weight of CeMCU is 39.6 kDa based 
upon its amino acid sequence. Repeated three times with similar results. 
i, Crosslinking of human MCU expressed in HEK293 membranes. 
Indicated concentrations of the membrane-permeable crosslinker 
disuccinimidy] glutarate (DSG) were used and human MCU was detected 
by western blot using a C-terminal Rho-1D4 antibody tag (Methods). 
Molecular weight standards are located in the first lane and their 
positions indicated. We note that some oligomerization of human MCU 
was observed without crosslinker; this phenomenon has been observed 
previously for human MCU~. Repeated four times with similar results. 

j, Crosslinking of purified zebrafish MCU in the detergent digitonin. 
Indicated concentrations of BS-3 crosslinker were used. Samples were 
analysed by SDS-PAGE using coomassie stain. Molecular weight standards 
are located in the last lane and their positions are indicated. Asterisks 
indicate protein impurities. Repeated twice with similar results. 
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Extended Data Fig. 3 | Flowchart for cryo-EM data processing of CeMCU. Details can be found in the Methods. 
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Extended Data Fig. 4 | Flowchart for cryo-EM data processing of zebrafish MCU. a, Initial model generation and improvement. b, 3D refinement 
using the improved initial model from a. Details can be found in the Methods. 
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Extended Data Fig. 6 | Cryo-EM structure determination and density 
of CeMCU. a, Orthogonal slices (top) and orthogonal 2D projections 
(bottom) of the final 3D reconstruction (from cisTEM). b, Angular 
orientation distribution of the particles used in final reconstruction. 

The particle distribution is indicated by different colour shades. 

c, Gold-standard FSC curve of the final 3D reconstruction. The resolution 
is 3.2 A at the FSC cutoff of 0.143. A thin vertical line indicates that only 
spatial frequencies to 1/(7 A) were used to determine particle alignment 
parameters during refinement. d, Local resolution of the map estimated 
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using the blocres program and coloured as indicated. e, Model validation. 
Comparison of the FSC curves between model and half map 1 (work), 
model and half map 2 (free), and model and full map are plotted in green, 
red and blue, respectively. f-i, Densities (80 contours) of a-helical regions 
of CeMCU are shown in the context of the atomic model with side chains 
shown as sticks and the backbone as ribbons. j, Stereo representation of the 
density (80 contour) of the NTDs, with the interface between chains A and 
B in the foreground. 


© 2018 Springer Nature Limited. All rights reserved. 


Turret 


TM2 T™1 


IMS view 


c 


ee TMtex| LS 


Coiled coil 


NTD 


Legs 


NTDs 


Extended Data Fig. 7 | Structure of CeMCU. a, Ribbon representation of 
an isolated subunit from Fig. 1c, viewed parallel to the membrane, with the 
structural features labelled. Disordered regions connecting TM2 and a6 
are indicated as dashed lines. b, c, IMS and matrix views of Fig. 1c. 

d, e, Molecular surface coloured according to electrostatic potential 

(red, —8 kT e~!; white, neutral; blue, +8 kT e~'). d, Shown in the same 
orientation as Fig. 1c with approximate membrane boundaries as grey 
bars. e, IMS view. A green sphere indicates the position of Ca** in site 1. 

f, Overall structure of CeMCU, depicted similarly to Fig. 1. g, Subunit 
interactions within the TMD. Ribbon representations of TM1 and TM2 
from one subunit (green) and TM2 of the neighbouring subunit (red) 

are shown. Residues participating in van der Waals or hydrogen bonding 
interactions are shown as sticks; dashed lines indicate hydrogen bonds. 
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Atom colouring: nitrogen, blue; oxygen, red; and sulfur, green. 

h, Interfaces within the NTDs. The NTDs of subunit A, B and C 

(green, red and yellow, respectively) are shown in ribbon and surface 
representations. Each NTD contains six 8-strands (81 through 36) and two 
a-helices (a1 and «2). There are four interfaces between the NTDs, and 
these are of two types. Interface 1 (for example, between protomers A and 
B or Cand D in the atomic model) consists of an interaction between 33 
of one NTD and the 31-2 loop of another; this interface buries 2,188 2 
of molecular surface in the assembled channel. Interface 2 (for example, 
between protomers A and D or B and C) is less extensive, burying 1,440 A 
total surface area, and involves contacts located near the N-terminal ends 
of the al helices from two NTDs. i, j, Details of interfaces 1 and 2 between 
NTDs, respectively. 
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Extended Data Fig. 8 | Comparison with an NMR structure (PDB: 
51D3). Left, various representations of the cryo-EM structure of CEMCU. 
The NTD, TM1ext, TM1, TM2 and coiled-coil regions are shown in 
different colours. Right, structure deduced from NMR studies of C. elegans 
MCU-ANTD (PDB: 5ID3). It is coloured according to the cryo-EM 
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structure using the sequence alignment shown in Extended Data Fig. 1. 
The boxed regions highlight differences in vicinity of the WDXXEP 
signature sequence and pore. Pore-lining residues (D225, E228, T231 and 
Y¥232; C. europaea numbering) are shown as sticks. 
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Extended Data Fig. 9 | Structural comparison of the NTDs of CeMCU, are shown. b, Crystal lattice from an X-ray structure of an isolated NID 


zebrafish MCU and human MCU. a, An NTD from the cryo-EM of human MCU (PDB: 4XTB). Four neighbouring NTDs are coloured 
structure of CeMCU (magenta) is superimposed with the crystal and other NTDs are grey. A similar arrangement is present in PDB 5KUG. 
structure of an isolated NTD from human MCU (grey, PDB: 5KUG, c, Matrix view of the cryo-EM reconstruction of zebrafish MCU (from 


rm.s.d. = 2.0 A). The secondary structure features are indicated; two views _ Fig. 2d). 
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Extended Data Table 1 | Cryo-EM data collection, refinement and validation statistics 


Data collection and processing 
Microscope 
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Magnification 
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Defocus range (um) 
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Software 
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Final particle images (no.) 

Overall map resolution (A) 
FSC threshold 0.143 

Local map resolution range (A) 


Refinement 
Software 
Initial model used (PDB code) 
Model resolution (A) 
FSC threshold 0.5 
Map sharpening B factor (A*) 
Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 
Lipids 
B factors (A’*) 
Protein 
Ligand 
R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 
Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 
Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 


CeMCU 
(EMD-7971) 
(PDB 6DNF) 


FEI Titan Krios (at MSKCC) 
Gatan K2 Summit 
29,000x 


0.8613 (0.4307) 
RELION 2.1, cisTEM 1.0 


C2 
547,637 
376,541 
3.2 


2.8-5.6 
Phenix 1.13 real-space-refine 


N/A 
4.0 


Zebrafish MCU 
(EMDB-7972) 


FEI Titan Krios (at MSKCC) 
Gatan K2 Summit 

37,000x 

300 

100.2 

—1.2~-3.2 

0.692 (0.346) 

RELION 2.1, cryoSPARC, 
cisTEM 1.0, 


196,272 
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The formation of solar-neighbourhood stars in two 
generations separated by 5 billion years 


Masafumi Noguchi!* 


The chemical compositions of stars encode those of the gas from 
which they formed, providing important clues regarding the 
formation histories of galaxies. A powerful diagnostic is the 
abundance of a elements (O, Mg, Si, S, Ca and Ti) relative to 
iron, [«/Fe]. The a elements are synthesized and injected into 
the interstellar medium by type II supernovae, which occur about 
ten million years after their originating stars form; by contrast, 
iron is returned to the interstellar medium by type Ia supernovae, 
which occur after a much longer timescale of roughly one billion 
years!. Periods of rapid star formation therefore tend to produce 
high-[a/Fe] stellar populations (because only type II supernovae 
have time to contribute to interstellar-medium enrichment as the 
stellar population forms), whereas low-[c/Fe] stars require periods 
of star formation that last more than a few billion years (over 
which timescales type Ia supernovae begin to affect the elemental 
composition of the interstellar medium more strongly than type 
II supernovae). The existence of two distinct groups of stars in the 
solar neighbourhood”’’, one with high [c/Fe] and the other with low 
[c/Fe], therefore suggests two different origins, but the mechanism 
by which this bimodal distribution arose remains unknown. Here we 
use a model of disk-galaxy evolution to show that the two episodes 
of star formation® predicted by the ‘cold flow’ theory of galactic gas 
accretion’ also explain the observed chemical bimodality. In this 
scenario, the high-[c/Fe] stars form early, during an initial phase 
of accretion that involves infalling streams of cold primordial gas. 
There is then a hiatus of around two billion years until the shock- 
heated gas in the galactic dark-matter halo has cooled as a result 
of radiation and can itself commence accretion. The low-[a/Fe] 
stars form during this second phase. The peaks in these two star- 
formation episodes are separated by around five billion years. In 
addition, the large-scale variation in the abundance patterns of these 
two stellar populations that has been observed for the Milky Way” 
is partially explained by the spatial variation in this gas-accretion 
history. 

The Milky Way galaxy provides an excellent test bed for models 
of the chemical evolution of galaxies because its individual stars can 
be resolved and examined in detail, unlike most other galaxies. The 
bimodality’~’ that has been observed in the chemical composition 
of stars in the solar neighbourhood is generally assumed to suggest 
(although some scepticism remains'') that star formation in this region 
did not proceed continuously but halted for a certain period, leading 
to two separated episodes of star formation!”3. How star formation 
proceeds is largely governed by how the primordial gas accretes onto 
the forming galaxy. The longstanding paradigm—the shock-heating 
theory—argues that the gas that entered the dark-matter halo is heated 
to a high temperature by a shock wave that provides sufficient pressure 
for the gas to maintain equilibrium within the gravitational field of the 
halo. Then, the gas cools by emitting radiation and accretes gradually 
to the forming galaxy at the centre (hot-mode accretion)“. On the 
basis of detailed numerical simulations, a different hypothesis has been 
put forward that proposes that a substantial part of the primordial gas 
remains cold and reaches the galaxy in narrow streams in an almost 


freefall fashion (the ‘cold flow’ theory)*’®. A hydrodynamical simula- 
tion in growing dark-matter haloes® predicts the presence of a gap in 
gas accretion (figures 2-4 of ref. °) and the resulting bimodal history of 
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Fig. 1 | Chemical evolution and the chemical abundance diagram for 
the solar neighbourhood. a, c, e, Cold-flow model. b, d, f, Shock-heating 
model. All panels refer to the region 7 kpc < R< 9kpc, where R is the 
distance from the Galactic centre. T is the time since the beginning of the 
Universe. The accretion and star formation rates are expressed in masses 
of the Sun per year. The type Ia supernova rate is the number per century. 
The grey shaded region in a and c highlights the gap in gas accretion. Open 
circles in c indicate [a/Fe] at the two star-formation peaks. e, f, Stellar 
distribution in the chemical diagram, with the colour scale indicating the 
relative frequency with respect to the maximum value in each panel. The 
contours are observed distributions based on the APOGEE data‘. The 
purple line indicates the evolution of the gas, with the arrow indicating the 
time sequence. 
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Fig. 2 | Time of entering the dark-matter halo as a function of arrival 
time to the disk. For gas that arrives at the disk at time T, tent denotes the 
time at which it enters the dark-matter halo. Blue and red lines indicate the 
cold-flow and shock-heating models, respectively. The grey shaded region 
corresponds to the gap in gas accretion in Fig. la, c. Myir=1.23 x 10'7Mo 
is the current virial mass used in the models, where Mg is the mass of the 
Sun. 


disk growth (figures 9-11 of ref. 8), providing a viable basis for a more 
detailed understanding of chemical evolution. 

This work, based on these findings, takes this idea one step further 
by incorporating the star-formation process and chemical evolution 
in a simple model of disk-galaxy evolution, with the aim of clarifying 
the origin of the observed chemical bimodality. In the model, a grow- 
ing dark-matter halo gathers the surrounding primordial gas, which 
subsequently accretes to the disk component of the galaxy and forms 
stars!>, We construct two models with different gas-accretion histories, 
corresponding to the shock-heating and cold-flow scenarios. 

Figure 1 depicts the time evolution of the two models and the result- 
ing [a/Fe]-[Fe/H] diagram (referred to as the chemical diagram here- 
after) for the solar neighbourhood. The cold-flow theory predicts a 
clear chemical bimodality, whereas the shock-heating model produces a 
single sequence (Fig. le, f). The locations of the two stellar groups in the 
cold-flow model agree with those observed by APOGEE”. The period 
of dormant accretion about 6 Gyr ago (shaded part in Fig. 1a) is an 
essential event that contributes to creating the well-defined bimodality 
in the cold-flow model. How this gap arises is clarified in Fig. 2. We 
define fe, to be the time when the primordial gas enters the dark-matter 
halo and T as the cosmological time from the beginning of the Universe. 
At tent © 4.3 Gyr, the shock-heating model starts to accrete gas on the 
timescale of cooling, as opposed to that of freefall, and its accretion is 
delayed with respect to the freefall accretion in the cold-flow model. 
In the cold-flow model, the gas that arrives at the disk at T = 8.6 Gyr 
in the shock-heating model is already accreting at T ~ 7.3 Gyr. At 
tent 6.3 Gyr, the cold mode of accretion ends and the gas accretion 
is switched to the hot mode in the cold-flow model. Therefore, the gas 
that enters the halo after fen 6.3 Gyr will not accrete until T ~ 8.6 Gyr. 
This change in the accretion mode gives rise to a period in which accre- 
tion is halted (shaded region in Fig. 2), as documented in the previous 
numerical study®. After tent = 6.3 Gyr, the two models accrete gas on 
the timescale of cooling. Although the detailed times have a significant 
uncertainty of 1-2 Gyr depending on the assumed galactic mass, the 
prediction of the gap in the cold-flow model is robust. 

Figure 1 details the evolution as dictated by the cold-flow model. 
The first phase of gas accretion induces a brief period of active star 
formation with a half peak width of about 2 Gyr, and a elements start 
to be released immediately by type II supernovae. This star formation 
begins to weaken before a large amount of iron starts to be supplied by 
type Ia supernovae. Therefore, most stars formed during this epoch 
have high [a/Fe]. During the subsequent period in which star formation 
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weakens, type Ia supernovae start to dominate. Because the enrichment 
of a elements is overtaken by iron enrichment, [a/Fe] in the interstel- 
lar medium decreases further. When the second phase of gas accre- 
tion starts about 2 Gyr later and star formation is activated again, [c/ 
Fe] stops changing rapidly and decreases very slowly thereafter. The 
second-generation stars have lower [a/Fe] by about 0.2 dex than the 
first-generation stars (two open circles in Fig. 1c). The paucity of stars 
with intermediate [a/Fe] creates a gap between the high- and low-[a/Fe] 
sequences in the chemical diagram. The star-formation history in the 
solar neighbourhood deduced empirically from the observed chemical 
diagram using a galactic chemical evolution code’? reveals a gap about 
7 Gyr ago, in agreement with the prediction of the cold-flow model. 

Recent large-scale surveys such as GAIA-ESO’*, APOGEE” and 
HARPS GTO planet search’® extend the stellar-abundance study to a 
sizable fraction of the Milky Way, reaching more than 10kpc from the 
Sun. The prediction of the cold-flow model can be compared to the 
observations for this large-scale region of the Galactic disk. Figure 3g-i 
shows the distribution of stars in the chemical diagram for three differ- 
ent regions: the inner disk, the solar neighbourhood and the outer disk. 

In view of the idealized nature of the model, here we focus on the 
peak positions and the relative dominance of the two stellar groups. 
The low-[a/Fe] sequence increases its dominance over the high-[a/ 
Fe] one with increasing distance from the Galactic centre, in agreement 
with obseravtions””. This is because the second phase of accretion, 
which increasingly dominates (Fig. 3a—c), involves the gas with higher 
angular momentum, which accretes onto larger radii. This behaviour 
is consistent with the result from a more detailed simulation!’. We also 
note that the [a/Fe] ratio of the low-[a/Fe] sequence increases with 
radius in the model, whereas its [Fe/H] decreases. This is because the 
outer disk is still in a younger phase of chemical evolution at present, 
making stars from interstellar medium that is not yet heavily polluted 
by iron (Fig. 3f). The decrease in [Fe/H] with radius from 0.13 to 0.01 
to —0.26 is in fair agreement with the observed decrease from 0.2 to 
0.05 to —0.3, although the model [a/Fe] varies more strongly than the 
observation. This may be due in part to the idealization of the model, 
which assumes that the stellar yields and star-formation efficiency are 
constant, whereas there is evidence that these quantities can be varia- 
ble”*!, The model is not intended to be complete at this stage. 

The two stellar groups are closely related to different structural com- 
ponents of the Milky Way’. Most high-[c/Fe] stars belong to the thick- 
disk component. The low-[a/Fe] stars tend to have small velocities 
perpendicular to the plane of the disk and hence contribute to the thin 
disk. There is an argument”? that the formation of thick and thin 
disks is not strictly sequential but partly overlapped in time. If this is 
the case, then the simple time sequence of chemical evolution shown in 
the model should be revised. More realistic but expensive simulations 
that treat chemo-dynamical evolution directly” are required to address 
these kinematic issues. 

Recent advances in observational techniques make the galaxies in 
the Local Group and other nearby galaxies feasible targets for studying 
star-formation history by examining the colour-magnitude diagram 
of their member stars”°-?”. Because these spiral galaxies span a wide 
range of masses and the cold-flow theory predicts different evolutions 
depending on galactic mass, this nearby sample of galaxies provides 
powerful diagnostics for the theory. 

M31, the largest galaxy in the Local Group, seems to have experienced 
two episodes of star formation separated by a break about 6 Gyr ago”. 
The cold-flow theory predicts that the quiescent period between the 
two star-formation phases becomes longer for more massive galaxies; 
the break observed in M31 seems to be more prominent than the one 
inferred for the Milky Way”’, in agreement with this prediction. M33 and 
NGC 300, which are several times less massive than the Milky Way, have 
signatures of inside-out formation and their star-formation histories seem 
to be monotonic”®””. According to the cold-flow model, the Milky Way 
has approximately the minimum mass for which a clear star-formation 
gap is expected. Observations of M33 and NGC 300*°”’ appear to support 
this prediction, although the data include a large uncertainty. 
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Fig. 3 | Evolution of the cold-flow model in three different zones. a-i, 

Same as Fig. 1, but for three regions of the disk: 5 kpc < R<7kpc (a, d, g), 
7kpc<R<9kpc (b, e, h; identical to Fig. 1a, c, e) and 11 kpc << R< 13kpc 
(c, f, i). The rates in a—c are normalized so that the maximum value of the 


Detailed simulations suggest that the evolution of disk galaxies is 
a complex process that involves the interplay of multiple pathways”®. 
For example, the formation of the high-[a/Fe] sequence by gas-rich 
mergers at early times is recognized in some cases”*. Radial migration 
of high-[a/Fe] stars may contribute to bimodality!!. The simple model 
used here isolates the effect of cold flows and highlights their role. The 
cold-flow hypothesis can explain the red colour of massive elliptical 
galaxies, in combination with the energy feedback from active galactic 
nuclei”’. It also provides a plausible interpretation for the high inci- 
dence of strongly star-forming galaxies at high redshifts*”. Finally, the 
cold-flow model not only is applicable to a large population of distant 
galaxies but also provides a possible explanation for the diversity in 
nearby individual galaxies, including the Milky Way. 
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METHODS 
The disk-galaxy evolution model incorporating type Ia supernovae. The Milky 


Way model used in this analysis was obtained by using a code that divides a 
disk galaxy into a series of concentric annuli and calculates the time evolution of 
masses of gas and stars in each annulus under the specified gas-accretion history 
and star-formation law’. Following the result of cosmological numerical simu- 
lations*!, the growth of the virial mass of the dark-matter halo with redshift z is 
specified as Myir(z) = Myiroe 7X", where Mying is the present virial mass, K = 3.7 
is a constant* and the halo concentration parameter C at the present epoch is 
given by 


log(C) = 0.971 — 0.094log[M,,,.9/(10'*h_'M,)] 


where h = Hp/(100 km s~! Mpc™!) and Hp is the Hubble constant. The primordial 
gas gathered by the dark-matter halo is assumed to have the same density profile 
as the halo, described by the Navarro-Frenk-White profile. 

The shock-heating model is constructed by assuming that the gas is heated to 
the virial temperature when entering the host halo and accretes to the disk plane 
with the timescale of radiative cooling (or with the freefall time if the cooling 
time is shorter). The cold-flow model is based on the Dekel-Birnboim model”. 
On the basis of stability analysis and aided by numerical simulations, it provides 
information on when a stable shock wave appears in the growing halo and the gas 
accretion changes from cold mode to hot mode. The gas is made to accrete with 
the freefall time in the initial cold-flow phase, but with the longer of the cooling 
time and the freefall time after the shock develops. We used the prediction of the 
original Dekel-Birnboim model” for the cold-flow model presented here instead 
of the extended cold-accretion model". The latter model is a hypothetical model 
devised to match the observed large disk sizes of galaxies much more massive than 
the Milky Way (stellar mass larger than 10''Mo), for which the original Dekel- 
Birnboim model predicts too small radii. On the other hand, the Dekel-Birnboim 
model is based on a detailed stability analysis for virial shock and is confirmed by 
numerical simulation for collapsing gas spheres with radiative cooling. The Milky 
Way and other nearby spiral galaxies lie in the mass range for which the Dekel- 
Birnboim model is applicable. 

The original code considers the chemical enrichment of the interstellar gas 
attributed only to type II supernovae. Here, chemical enrichment due to type Ia 
supernovae is added. Specifically, « elements of 3.81Mo and iron of 0.094Mo5 
are added to the cold gas for one type II supernova. One type Ia supernova 
is assumed to return 1.65Mo of material to the cold gas, of which a 
elements and iron make up 0.438Mo and 0.88Mo, respectively. The yields for 
a elements by type II supernovae and for iron by type Ia supernovae adopted 
here are both increased by 20% from the values listed in the literature** to obtain 
the best match to the observation. Type II supernovae are assumed to explode 
instantaneously after the birth of progenitor stars. Type Ia supernovae take 
a certain period after stellar birth before they explode. An observationally 
estimated delayed time distribution (DTD)** is introduced so that the rate for 
type Ia supernovae (SN) is 


LETTER 


-11 
DID()=1x 10° | : 

M,Gyr | Gyr 

where t denotes the time since the progenitor formed. Considering the uncertainty 
in binary stellar evolution*’, we assume that DID =0 for t< 0.3 Gyr. The type II 
supernova feedback efficiency is taken to be ¢ = 0.15 instead of the original value 
0.05, to match the model with the Milky Way; this change does not influence the 
main conclusion of this work. All other model parameters are the same as those 
adopted previously’. 

Adopting the current virial mass of the dark-matter halo* of Myiro=1.2 x 10’7Mo 
(with a corresponding collapse redshift of 2K/C = 0.78), the cold-flow model pro- 
duces a stellar disk of mass 6 x 10'°Mo with a scale length of 3.0kpc in the present 
epoch, in broad agreement with the estimate for the Milky Way*”*®. The inner disk, 
solar neighbourhood and outer disk analysed here are defined as an annulus in the 
disk with 5kpc< R<7kpc, 7kpc< R<9kpc and 11kpc<R< 13kpc, respectively, 
where R is the galactocentric radius. Each region contains 15, 11 and 8 radial bins, 
leading to the spread of the stellar distribution shown in Figs. 1 and 3. The abundance 
ratio is the value relative to the solar value on a logarithmic scale. Solar abundances 
of [o/H] =2.22 x 10~? and [Fe/H] = 1.77 x 107? are assumed”. The contours in 
Figs. 1 and 3 are taken from ref. 4; credit: M. Haywood et al. A&A, 589, 66 (2016), 
reproduced with permission © ESO. 

Data availability. The data that support the findings of this study are available 
from the corresponding author on reasonable request. 

Code availability. We have opted not to make available the code used to calculate 
the evolution of the disk-galaxy models because it is a part of an integrated program 
currently in use for other projects. 
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Spontaneous emission of matter waves from a 
tunable open quantum system 


Ludwig Krinner!, Michael Stewart!, Arturo Pazmifio!, Joonhyuk Kwon! & Dominik Schneble!* 


The decay of an excited atom undergoing spontaneous photon 
emission into the fluctuating quantum-electrodynamic vacuum 
is an emblematic example of the dynamics of an open quantum 
system. Recent experiments have demonstrated that the gapped 
photon dispersion in periodic structures, which prevents photons 
in certain frequency ranges from propagating, can give rise to 
unusual spontaneous-decay behaviour, including the formation of 
dissipative bound states!~>. So far, these effects have been restricted 
to the optical domain. Here we demonstrate similar behaviour in 
a system of artificial emitters, realized using ultracold atoms in an 
optical lattice, which decay by emitting matter-wave, rather than 
optical, radiation into free space. By controlling vacuum coupling 
and the excitation energy, we directly observe exponential and 
partly reversible non-Markovian dynamics and detect a tunable 
bound state that contains evanescent matter waves. Our system 
provides a flexible platform for simulating open-system quantum 
electrodynamics and for studying dissipative many-body physics 
with ultracold atoms*®. 

The Weisskopf—Wigner model of spontaneous emission”*®, a cen- 
tral concept in quantum optics’, describes how an excited atom can 
decay to its ground state owing to coupling to zero-point oscillations 
of the electromagnetic vacuum. It simultaneously represents one of 
the first open quantum systems discussed in the literature and an area 
of research that has recently seen a resurgence of intense theoretical 
efforts!°-5, In its usual Markovian formulation, the model makes the 
assumption that the decay proceeds on a much slower timescale than 
the optical period, which leads to a memoryless, exponential decay 
of the excited-state amplitude and to an associated Lamb shift of the 
transition frequency. For free-space emission, the Markovian approx- 
imation is generally fulfilled to high accuracy. 

On the other hand, modifications to the mode density of the vacuum 
can change the features of spontaneous decay. This was recognized first 
in the 1940s!* and again decades later', during the development of 
cavity quantum electrodynamics'®', where the decay can be altered 
to the extreme point of coherent vacuum Rabi oscillations when the 
spectrum is restricted to a single mode. Between these two limits lies 
the regime of a vacuum with a bounded continuous spectrum, in 
which a strong modification of spontaneous decay behaviour occurs 
close to the boundary. An example is photonic crystals (also called 
photonic-bandgap materials)!*”°, where a periodic spatial modula- 
tion of the refractive index gives rise to a gapped dispersion relation. 
For emission close to a bandgap, the Markovian approximation can 
no longer be applied, and novel features appear, including oscillatory 
decay dynamics for energies above the band edge and the formation 
of atom-photon bound states below it?!, Over the past two decades, 
experiments on spontaneous emission in photonic-bandgap materi- 
als, including the microwave domain, have observed some of these 
effects, specifically modified spontaneous emission rates””*? and Lamb 
shifts”, as well as spectral signatures for non-exponential decay’. Very 
recently, experiments have probed the long-predicted atom-photon 
bound state”>”®, using both transmon qubits coupled to corrugated 
microwave guides? and atoms in photonic-crystal waveguides’, 


with the prospect of engineering systems with optical long-range 
interactions”. 

Here, we realize an atom-optical analogue*® of emission in a 
one-dimensional photonic-bandgap material, where the singularity 
in the mode density near the edge of the continuum leads to particu- 
larly strong deviations from Markovian behaviour. In our system of 
matter-wave emitters, the free tunability of the excitation energy and 
decay strength allows for a systematic exploration of the emergence 
of non-Markovian dynamics, including partial reversibility and the 
formation of a matter-wave bound state that can be directly detected. 
Importantly, the close spacing of emitters gives rise to collectively 
enhanced dynamics beyond the Weisskopf-Wigner model. 

The experimental configuration is shown in Fig. la. Using a deep 
three-dimensional optical lattice with state selectivity along one axis, 
we prepare a sparse array of atoms confined to sites that are embedded 
in a system of isolated tubes acting as one-dimensional waveguides 
(see Methods for details). An atom’s internal state (|r), red) is coher- 
ently coupled to a second, unconfined internal state (|b), blue) using an 
oscillatory magnetic field. Each site thus acts as a two-level matter-wave 
emitter, with harmonic-oscillator ground-state occupational levels |g) 
(empty) and |e) (full), supporting both the emission (for |e) — |g)) and 
the absorption (for |g) — |e)) ofa |b) atom. The excitation energy of 
the emitter, which is given by the detuning, A, of the coherent coupling 
from the atomic resonance, is converted into kinetic energy for atomic 
motion along the axis of the waveguide. 

One of the main features of each matter-wave emitter is its ability to 
undergo spontaneous decay, as described by the Weisskopf-Wigner 
model. Assuming no lattice potential, the driven atom performs simple 
Rabi oscillations between two internal states |r) and |b). These oscilla- 
tions are described by the Hamiltonian H=(hQ / 2)e"rh +h.c., where 
§2 denotes the strength and 6 the detuning of the coupling from the bare 
atomic resonance, hf is the Planck constant and ‘h.c? denotes the 
Hermitian conjugate. The tight confinement of just one of the states 
(here, |r)) strongly couples the atom’s internal and motional degrees of 
freedom, producing a zero-point energy shift of 2) =hw,/2>> hQ, 
where Wy is the harmonic-oscillator frequency in the potential, as well 
as a kinetic-energy shift of ¢, = h’k*/(2m) for the motion of the free 
|b) state at hk momentum. As a consequence, the detuning and 
strength of the coupling are shifted to A, = 6+ (E)—&)/A and Q= 
274, respectively, with += (k|w.) denoting the overlap of the 
external wavefunctions. Integration over all possible momenta k then 
yields® H = hg e'!|g)(elb,+ h.c., with g, = {2;/2; that is, the stand- 
ard Weisskopf-Wigner Hamiltonian describing spontaneous emission 
into a vacuum of modes (k, ¢;) (see Fig. 1b). In contrast to optical 
emission in free space, the dispersion relation ¢; is quadratic, as in a 
photonic crystal (see Fig. 1c). In such crystals, the emission energy 
relative to the edge of the continuum may be adjusted through the 
crystal’s band structure; in our system, the excitation energy, 
hA = hA,;~», itself is tunable, including the case A <0. Importantly, 
the tunability also includes the vacuum coupling, gx, which is set by (2. 

A common scenario considered in the Wigner-Weisskopf model is 
emission deep into the continuum, such that the decay dynamics is 
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Fig. 1 | Realization of matter-wave emitters. a, Experimental 
configuration. An occupied site of an optical lattice embedded in 

a single-mode matter waveguide acts as an elementary emitter of a 

single atom; adjacent empty lattice sites act as absorbers. The bottom 
illustration shows the momentum distribution in the waveguide after 
release and free expansion, where hk, = h(27/.,) (with a wavelength of 
Az = 790.1 nm) is the recoil momentum. b, Emission mechanism: (i) bare 
internal-state pair in 8’Rb (hyperfine ground states |r) = |F=1, mp=—1) 
and |b) =|F=2, mp=0) split by 6.8 GHz); (ii) state pair in a frame co- 
rotating with a near-resonant microwave field with variable detuning 

6 and coupling strength 2; and (iii) state pair in the co-rotating frame 
after applying the state-selective lattice potential (detuning shifted to 
A=6 + wo/2, where wo =h x 40 kHz). The microwave couples (with 

$2 = (27) the trapped |r) state to free |b,) states with momentum k and 
kinetic energy ¢,. ¢, The filled (empty) potential well can be viewed as the 
excited (ground) state |e) (|g)) of a matter-wave emitter. The emission of 
atoms in this scenario is similar to the emission of photons in photonic- 
bandgap (PBG) materials, with both featuring quadratic dispersions and 
energetically forbidden regions. 


much slower than the timescale set by the excited-state energy (or the 
elevation above the band edge in the case of a photonic crystal). This 
allows for a Markovian treatment and results in exponential decay of 
the excited state. Following Fermi’s golden rule, the decay width, J; is 
the product of the mode density, p, and the square of a matrix element, 
Hye, which for optical decay is the product of the electric dipole moment 
and the zero-point field of the resonant mode. For our system, an 
analogous analysis® (valid for (2/A < 1) leads to P= 2 / WA, con- 


taining the one-dimensional mode density p « 1/ ./ A and H, ee KAD; 
where k = ,/2mA/h represents the resonant mode. 

Because of the residual axial tube confinement, w,, all measurements 
are taken for (2/w,> 1 and associated timescales shorter than 
T;=25/w, 10ms. The measured |r) population is shown in Fig. 2 for 
parameters in the (quasi-)Markovian regime (.2/A)* < 1 asa function 
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Fig. 2 | Markovian regime. a, Time evolution of the lattice population for 
Q=2n x 0.74(5) kHz and A=2r x 1.9(3) kHz (red symbols). Each point 
is the average of at least three measurements and error bars show the 
standard error of the mean (s.e.m.). The red line is a phenomenological 
exponential decay curve with a fitted rate of 2a x 94(3) Hz and an offset of 
0.503(4). The light-grey lines represent the Markovian approximation 
(dashed, = 2n x 72(12) Hz) and the exact analytical solution for an 
isolated emitter® (solid). b, Lattice population as a function of A for 
t=0.4 ms and (2=2r x 1.5(1) kHz. The solid line is the Markovian 
expectation with the overall decay width, I, scaled by 0.61(1). c, Detected 
momentum distribution of |b) atoms versus A for parameters as in b. The 
dashed line is the single-particle dispersion; data for small positive and for 
negative detunings are outside the Markovian regime. d, Raw time-of- 
flight data for extracting the energy shift at A= 27 x 6.0(3) kHz. Colour 
scale as inc. e, Measured shifts 6, = Ag—A in the regime 2/A <1 for 

2t = 1.24 and averaged over A=2r x {1, 2, 4, 6} kHz. The data are 
extracted from the second moment (maximum) of the momentum 
distribution, shown by blue squares (red circles). Error bars show the 
s.e.m. The blue solid and red dashed lines are quadratic fits and the grey 
dotted line represents 6,({2). 


of time (Fig. 2a) and detuning (Fig. 2b); the data in Fig. 2a are for 
QA 0A at AX2n7 x 2 kHz (with [=27 x 72 Hz). After a variable 
coupling time ¢, we observe an irreversible, exponential decay in agree- 
ment with the expectation; however, the measured population does not 
decay to zero but instead saturates at a finite value. We qualitatively explain 
this discrepancy by taking into account that an excited emitter is not iso- 
lated but part ofa (mostly) ground-state array that enables reabsorption, 
in analogy to an optically thick medium. In the Weisskopf—Wigner for- 
malism, the array is modelled by introducing site-dependent phases 
and projectors, resulting in* H = De ig, ef Ant“) Ig.)e\| by +huc. 
Following a master-equation-based treatment”, we expect excitations to 
be transferred between neighbours on a timescale of t + 1/['=2ms, 
leading to a slowdown of the decay, in qualitative agreement with 
Fig. 2a (see also Methods section ‘Numerical simulation of array 
effects’). Moreover, the emitted atoms cannot escape from our system 
for long times, which induces the formation of a steady state as t 
approaches 7,. Here, additional dephasing effects that are not consid- 
ered in this model may arise from collisions between emitted atoms 
(scattering between modes in each tube). At early times, t < 1/I; with 
still weak reabsorption, the decay at a fixed time T (see Fig. 2b) displays 
approximately the expected detuning dependence of the Weisskopf- 
Wigner model, exp[—I(A)r] = 1 — I{A)z (albeit with a downward 
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Fig. 3 | Non-Markovian dynamics and bound-state formation. 

a, Time evolution of the |r) population for A = —2n x 0.1(3) kHz and 
Q=2n x 3.0(3) kHz (with points and error bars as in Fig. 2a). The grey 
line is our analytical model. b, Same as a, but for A = —27 x 1.7(3) kHz. 
The fitting parameters of the analytical model are A = —2n x 2.08(3) kHz 
and 2= 27 x 2.79(4) kHz. ¢, Illustration of a stationary bound state 

for negative excitation energies. d, Asymptotic fraction of |b) atoms, 
extracted from the time evolution after t= 2.6 ms, for as ina and 
A=-—2n xX 2.2(3) kHz. The top (bottom) histogram refers to the case 

of a sudden (adiabatic) turn-on of the coupling, measured from 50 
experimental runs each. e, Illustration of separation of evanescent and 
propagating waves along the weakly confining tube axis. The atomic 
sample resides off-centre from the minimum of the axial tube potential 
(solid line), shifted from that of the initial trap (dashed line), which 
defines the sample position. While the sample and bound state are frozen 


rescaling of the actual value of Mat the finite pulse time 7=0.4 ms used, 
with reabsorption effects already apparent). 

We next characterize the momentum distribution of the emitted atoms. 
For this purpose we apply a 0.4-ms-long coupling pulse and then observe 
the location of the |b) atoms after 15 ms of free fall, using state-selective 
absorptive imaging (see Methods). On the basis of the Markovian approx- 
imation, isotropic emission with wavepackets centred near the resonant 
momentum k(A) is expected. In Fig. 2c we show the observed momen- 
tum distribution as a function of A; the emission clearly exhibits parabolic 
dispersion. Moreover, the spectral width, o;, of the separated wavepackets 
decreases with detuning, in qualitative agreement with the expectation 
(0% 1/A for large detunings®; a quantitative comparison is compromised 
by the finite time of flight). The ‘intensity’ of the emitted matter-wave 
pulse strongly depends on the detuning, as shown in Fig. 2b. 

The standard Markovian treatment of the Weisskopf-Wigner model 
yields a Lamb shift of the ground and excited states as a unitary cou- 
pling to the vacuum. An analogous analysis for our system*® yields a 
shift 6, = 27/wy of the excited-state energy, to (A — 6,). We measure 
the momentum distribution for variable 2 at several values of A and 
then calculate the mean kinetic energy of the wavepackets both from 
the second moment of the momentum distributions and from the loca- 
tion of their fitted maxima (see Methods for details). To facilitate com- 
parison with the model, the data are taken for a constant effective pulse 
area 2t, where 2 = (2/ Wwp)!/ °Q, and t < 1 ms to mitigate propagation 
effects. The results for the (quasi-)Markovian regime (2/A)? < 1 are 
shown in Fig. 2d, e as a function of 2. The extracted shift has the sign 
and approximate quadratic dependence of 6, but is a factor of roughly 
three larger. We caution that while this alone could point towards the 
existence of collective enhancement, there is no indication for super- 
radiance*”® from the decay data (which is consistent with the fact that 
there is no overall population inversion in the array). 

Our system readily allows the study of spontaneous emission outside 
the Markovian regime, as shown in Fig. 2c. In particular, the diverging 
one-dimensional mode density near ¢,=0 greatly enhances the effects 
of the edge of the continuum. For emission at low excitation energy, 


Transferred population 


Momentum, k/k, 


in place (vertical dotted line), the nonadiabatically released fraction (blue 
arrow) is accelerated towards the potential minimum. f, Momentum 
distributions of |b) atoms for the two scenarios considered in d. Open 
(filled) circles represent the sudden (adiabatic) turn-on of the coupling, 
averaged over all 50 runs, and triangles show the difference of the two 
datasets. The solid line represents the square of the Fourier transform 

of the analytical evanescent wavefunction’, fitted to the adiabatic data 
(Agt = 27 x 2.1(1) kHz). The centre of mass of the non-adiabatic dataset is 
shifted by p, = 0.32(1)hk, relative to the adiabatic dataset, and that of their 
difference (with a half-width at half-maximum of 0.57(1)hk,) is shifted by 
ps=0.80(1)hk,. The expectation for free atoms is ps = 0.83(7)hk,. Lower 
inset, corresponding real-space evanescent wavefunction (blue, solid) 

and Wannier function of a lattice-trapped |r) atom (dashed, red). Upper 
inset, raw data for the momentum distributions of e, before subtraction of 
spurious higher-band contributions (see Methods). 


A/2 <« 1, we expect dynamics reminiscent of a two-level system, with 
damping provided by low-energy modes. Results of measurements at 
A=0 are shown in Fig. 3a. We observe oscillations similar to the pre- 
dictions of our isolated-emitter model® (which now features a finite 
offset, in contrast to positive detunings) but with higher frequency and 
less damping, suggesting that the dynamics is coherently enhanced by 
low-energy modes whose wavelengths can extend over several emitters 
(see also Methods section ‘Numerical simulation of array effects’). 
For emission below the continuum edge (that is, for A <0), we expect 
the formation of a stationary bound state*®, as illustrated in Fig. 3c. For 
our one-dimensional system, this state consists of a partly excited emit- 
ter dressed by an evanescent, approximately exponentially decaying 
matter wave, with a binding energy of hwg + hA and a localization 
length® of €= 1/./2m |w,| /h. To isolate the bound state, the coupling 
needs to be turned on slowly to prevent the additional population of 
freely propagating modes® representing a non-adiabatic, transient shed- 
ding of matter waves. However, first we proceed as before by switching 
on the coupling, at A =—27 x 1.7kHz. The lattice population, shown 
in Fig. 3b, shows a transient oscillation (with much lower amplitude 
than at the edge) settling to an asymptotic value below unity (with an 
observable |b) population; see Fig. 2c). Remarkably, our isolated-emitter 
model® now closely fits the data within the experimental uncertainties. 
Indeed, for the chosen parameters, € is less than half a lattice period, 
which should lead to a relative suppression of long-range couplings. 
To access the properties of the bound state, we first determine the frac- 
tion of |b) atoms by comparing the asymptotic lattice population for a 
sudden and for an exponential turn-on of the coupling. The results in 
Fig. 3d show that 7.1(2)% of the population are in the evanescent wave, 
with a total |b) population of 12.7(2)% (here and elsewhere the errors 
quoted denote one standard deviation). The observed fraction of |b) 
atoms in the bound state (55%) is close to the expectation® of 47% for 
the chosen parameters, with the excess possibly stemming from residual 
non-adiabaticity of the ramping (we noticed an inconsistency in ref. ° for 
the total |b) fraction, but this does not affect the relative |b) fraction in 
the bound state). Importantly, the recorded momentum distribution of 
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the bound evanescent fraction of |b) atoms (see Fig. 3f) can be directly 
compared to the prediction of our analytical model*® owing to the absence 
of propagation effects. Fitting the model to the data for the adiabatic 
ramp-up with A as the only fitting parameter reproduces its experimental 
value to within the experimental uncertainty, and the agreement between 
the model and the measured momentum distribution is indeed excellent. 
The localization length obtained from the exact model is €= 142(3) nm 
(we note that for the parameters chosen’, wg ~ 1.3A). 

Finally, additional direct evidence for the non-adiabatically released 
part can be obtained by comparing the adiabatic momentum distribu- 
tion to that recorded after the rectangular pulse. Because our sample is 
prepared off-centre from the potential minimum along the tube direction 
(see Fig. 3e, Methods), the distribution of the shed |b) atoms can separate 
in momentum space from the bound fraction. The pulse duration used in 
this experiment corresponds to about a quarter of the oscillation period 
(2.6 ms) along the tube axis, thus maximizing this differential effect. As 
seen in Fig. 3f, the (symmetric) difference of the two momentum distri- 
butions is centred at the finite momentum expected for atoms released 
with zero velocity at the beginning of the pulse. This finding is consistent 
with the time evolution in Fig. 3b, which suggests that the release of the 
unbound fraction occurs within a short time (about 0.5 ms) after the 
coupling is turned on. We note that as a result of propagation in the tubes 
for a quarter of the oscillation period, the width of the momentum distri- 
bution of the released atoms reflects that of their distribution in real space 
(v,= Zw). The extracted half-width at half-maximum of 5.47(9) um is 
comparable to the Thomas—Fermi radius of the initial condensate. 

Much of the present work has focused on basic properties arising from 
the tunability of our Wigner-Weisskopf system, including the formation 
of bound states below the edge of the mode continuum. On the single- 
emitter level, this provides a direct analogy to atomic decay near the 
bandgap of a photonic crystal. We note that in yet another context, the 
observed non- Markovian oscillatory dynamics also reproduces predic- 
tions for electron photodetachment from negative ions’””. The optical 
lattice geometry opens up various additional avenues of inquiry. For 
emission sufficiently above the continuum edge, these may include novel 
types of superradiance that depend on the degree of coherence of the lat- 
tice population (superfluid or Mott insulating)*° and have no analogue 
in optical systems. Moreover, controlling the longitudinal waveguide 
level spacing should enable studies of the transition between Dicke- 
and Tavis-Cummings-type models in quantum optics*? (restricted to 
co-rotating terms), including their modification in the non-Markovian 
regime. Unlike photons, the emitted atoms can directly interact with 
each other, which should give rise to additional, nonlinear effects that 
modify the population dynamics. For negative energies, the bound state 
lends itself to the realization of lattice models* with modified tunnelling 
and interactions. Superficially, the structure of the bound state resem- 
bles that of a lattice polaron*! (for which a phononic Lamb shift has 
recently been measured**), with massive vacuum excitations replacing 
massless Bogoliubov sound excitations. Rather than reducing transport, 
the bound state here leads to an enhancement of mobility. The presence 
of tunnelling with a tunable range is of interest, for example, for studies 
of integrability and thermalization in one-dimensional geometries. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0348-z. 
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METHODS 


Experimental procedures. In this section we outline experimental procedures 
implemented in our measurements. 

Sample preparation. In the experiment we used ®’Rb atoms in the hyperfine ground 
states |r) =|F=1, mp=—1) and |b) =[2, 0) (the fact that |r) lies below |b) is incon- 
sequential in the rotating frame). The atoms are confined to a two-dimensional 
array of roughly 10° isolated lattice tubes spaced at 532 nm, each with a radial con- 
finement of w, =27 x 26kHzand a residual axial confinement of w,=27 x 97 Hz 
that quantizes the mode spectrum for released |b) atoms in the z direction; this 
confinement is inconsequential for times much shorter than 7,=27/w, + 10 ms. A 
state-selective lattice with period 4,/2 = 395 nm and harmonic-oscillator frequency 
wo =2n x 40(1) kHz strongly confines the |r) atoms along the tube axis. 

Starting with an optically trapped Bose-Einstein condensate™, we first create an 
atomic Mott insulator of |r) atoms by simultaneously ramping up all three optical 
lattices over a time of 90 ms to depths of 40E,1 o64nms40Es1,064nm and 30E,,790.1nm> 
where E,,,= (h/A)?/(2m) is the respective recoil energy, in the x, y and z directions, 
respectively. This procedure results in a deeply confined Mott-insulating sample, 
while also shifting the minimum of the tube potentials by about 10 1m with respect 
to the sample. The shift (which is comparable to the sample size and corresponds 
to the change of the gravitational sag during the ramp™ after the atoms are effec- 
tively pinned by the z lattice) leaves the resonance condition for individual atoms 
unaffected because |r) and |b) experience the same tube potential. 

After loading the atoms into the lattice, we transfer a fraction of approximately 

0.82 to an intermediate state |2, 1) using a two-photon radiofrequency pulse 
(microwave radiation of 6.8 GHz and radiofrequency radiation of about 3 MHz) 
of 0.95 ms duration. These atoms are eventually removed using resonant light on 
the D, cycling transition (F = 2—+F’ = 3). Using the |2, 1) state has the advantage 
that there is no first-order shift in the energy difference between |1, —1) and |2, 1) 
due to either magnetic field or the state-selective optical potential. Also, collisional 
shifts*> of doubly (triply) occupied sites are roughly 50 Hz (100 Hz) for our lattice 
parameters, much less than the spectral width of the pulse. This yields a sample 
of 2.8(2) x 10* |r) atoms with an average site occupation of (m;) < 0.5 in the tubes. 
Having thus created an initial state of matter-wave emitters, we then switch on a 
6.8-GHz microwave field of variable coupling strength 2 and detuning A. 
Atom detection. We switch off the microwave coupling (2 suddenly, and we measure 
the population remaining in the lattice and access the momentum distribution of 
the released atoms with state-selective absorptive imaging, using a combination 
of band mapping and Stern—Gerlach separation during the time of flight (TOF). 

The state detection begins with a 500-1s bandmap step during which all lat- 
tice potentials are ramped down to zero (the bandmap step prevents a rapid 
transverse expansion out of the tube potentials and is therefore favourable for 
momentum-space detection when small atom numbers are used). Subsequently 
all remaining trapping potentials are turned off suddenly, and after an expansion 
time of about 1 ms a pulse sequence is applied for magnetic-field characterization 
(see below), immediately followed by a 5-ms-long magnetic field gradient pulse 
to separate states of different magnetic moments. Finally, 14.5 ms after the release 
from the optical potential, a 200-|1s-long imaging pulse of resonant light on the 
F=2— F'=3 D, cycling transition is used to detect the F=2 atoms; this yields a 
total effective TOF of 14.6 ms for atoms in this state. The F= 1 atoms are repumped 
for 100 js using resonant D, light (F = 1 — F’ = 2) after an additional 2.7 ms, 
immediately followed by another 200-1s-long imaging pulse (D2, F=2 — F’ =3) 
to detect the repumped F= 1 atoms after a total TOF of 17.6 ms. Because all mag- 
netic moments are unique within F= 1 and F=2 separately, we fully resolve the 
population in each individual hyperfine state |F, mp). We note that technically the 
cloud centres of F= 1 and F=2 along the imaging direction are still overlapped 
during the second imaging pulse, however the F=2 atoms are pushed out of the 
field of view by the time the F= 1 detection occurs. 

Image analysis. Our raw data are images acquired using a Princeton Instruments 
PIXIS:1024B charge-coupled device (CCD) camera with WinView32 software in 
.spe format. We use Mathematica to read the raw image data and create standard 
absorption images. Residual fringes are removed by subtracting typical fringes 
found using principal-component analysis of a large set (>100) of empty absorp- 
tion images. 

Momentum-space calibration. Our standard momentum calibration relies on 
Kapitza—Dirac diffraction®* from the z lattice. For a more precise determination 
of emission momenta, we take into account residual propagation in the tubes that 
slows the atomic motion. After ramping up the z lattice, the tubes are created by 
partial retro-reflection of the Gaussian beams of our optical trap (1/e? radius of 
w= 135 |tm)*4, which leads to an increase of the optical confinement w,/(27) from 
72(1) Hz to 97(1) Hz (with the gravity direction along z). The tubes are again 
ramped down within 500 1s after the microwave pulse (together with the z lattice, 
for band-mapping purposes), followed by a switch-off of the optical trap. We 
numerically simulate the motion of atoms in the tubes by assuming that the release 
(with momentum +4) occurs midway through the pulse, at the centre of the 72-Hz 
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trap, and by calculating the trajectories in the time-dependent optical potential 
until detection, after a TOF of 14.6 ms. We see that the calibration differs from the 
Kapitza-Dirac diffraction results by —0.5%, —1% and —6% for pulse durations of 
0.2 ms, 0.4ms and 1 ms, respectively, with negligible differences for shorter pulses. 
These corrections are included in Fig. 2. 

Characterization of magnetic fields and optical potentials. The quoted uncer- 
tainty of 300 Hz in the detuning A has contributions from both differential Zeeman 
and alternating-current Stark shifts, which are characterized as follows. 
Magnetic fields. All experiments are carried out at fields between 4.9998(1) G 
and 5.0002(1) G, where the error corresponds to an uncertainty in the bare-level 
splitting of 70 Hz between |r) and |b). The magnetic field for each iteration of the 
experiment is monitored using a series of Rabi pulses during the TOF*’, in which 
atoms in |1, —1) are redistributed to the |2, —2) and |2, —1) states. This method 
allows field reconstruction to within 100 \1G (for details see ref. °”). The inhomo- 
geneity of the magnetic field across the sample is characterized using a Ramsey 
pulse sequence. This inhomogeneity does not exceed 70 \1G, which corresponds 
to a root-mean-square (r.m.s.) variation of A of 15 Hz across the sample for the 
(|r), |b)) state pair. 

Differential optical potential. The state-selective optical potential is created using 
ao -polarized laser beam (waist, 230 jum) at \,=790.10(2) nm. The polarization 
and wavelength are set (using a \/4 waveplate and laser controls) such that the 
lattice potential seen by the |r) atoms is maximized, while the |b) atoms experience 
zero potential. The change in lattice potential with wavelength is 0.12E,/(0.01 nm), 
whereas the maximum theoretically possible polarization change (a~ to 7) changes 
the potential by 0.01£, (we note that the polarization is stable in the experiment). 
We characterize the state selectivity using a sequence of ten Kapitza-Dirac pulses 
(10 \1s) spaced at the Talbot resonance time*’, 7 = (4E,,790,1nm/h) | = 68 \1s, to 
ensure a suppression of the optical potential for |b) atoms by a factor of more 
than 100 with respect to that for |r) atoms, consistent also with the remaining 
wavelength uncertainty of 0.02 nm. 

A crucial part of the experiment is the reliable determination of the resonance 
condition, or the value of A. Because the creation of our matter-wave emitter 
(Fig. 1c) starts with a detuned Rabi oscillation of two hyperfine states, we first 
(post-)stabilize the magnetic field as discussed above and in ref. °”. The resonance 
condition (‘excited-state energy’) of the matter-wave emitters is determined by the 
detuning of the hyperfine spin and the zero point shifts to both states induced by 
the state-selective potential. Because we can currently control the wavelength only 
to an accuracy of 0.02 nm r.m.s., the overall uncertainty is limited to a maximum 
of +340 Hz r.m.s., with the most extreme fluctuations limited to 1 kHz. To address 
and monitor this issue, we bracket each measurement (such as measuring the time 
evolution at a given detuning) by a resonance curve; an average of such resonance 
curves is shown in Extended Data Fig. 1 (see also figure 5 in ref. *”). The resonance 
curve is taken without any transverse lattices on, with a low atom number in the 
optical trap (around 30,000 atoms) and only at partial transfer (at most 30%) to 
minimize systematic mean-field and density shifts. The residual systematic shift 
of the resonance condition due to mean-field and density effects is estimated to 
be less than 100 Hz by using direct simulation of resonance curves with the one- 
dimensional time-dependent Gross-Pitaevskii equation. The scatter of the centre 
of the resonance curves during a typical measurement has an r.m.s. value of 
0.3 kHz, which is the quoted uncertainty of the detuning and also matches the 
expected wavelength reproducibility of 0.02 nm. 

To characterize inhomogeneities of the state-selective optical potential, first we 

precisely calibrate the Rabi coupling strength (for a range of coupling strengths) in 
the absence of state-selective potentials. We then compare the maximum observed 
population transfer in a Rabi spectrum to the expected maximum population 
transfer into the lattice. Based on the comparison, we estimate an upper bound 
for the inhomogeneity of 300 Hz r.m.s. We note that the inhomogeneities of the 
trapping potential at 1,064 nm do not exceed 20 Hz r.m.s. across the sample. For 
comparison, the absolute magnitude of the spectroscopic shift of |1, —1) < |2, 0) 
in the centre of the optical trap does not exceed 400 Hz. 
Background subtraction. The sequence used to thin out the atomic sample leaves 
roughly 10° atoms in the |b) state before the microwave pulse is applied. This results 
in a diffuse background in the momentum distributions, as illustrated in Extended 
Data Fig. 2. We remove this background by subtracting reference data taken for 
zero pulse time. The result is shown in Fig. 2c. 

In Fig. 3f we also show the subtraction of background due to the very small 
fraction of higher-band population. The first excited band is expected at approx- 
imately 2x x 36(1) kHz or +3hk,. For the subtraction, we first fit three Gaussian 
peaks to the adiabatic ramp and one soft-edge box and two Gaussians to the sudden 
turn-on data, requiring identical atom numbers in the soft-edge box and the small 
Gaussian. The fitted functions are subtracted from the data, which then yield the 
main plot in Fig. 3f. The number of atoms emitted from higher bands is of the 
order of 510(30) for the sudden turn-on and 260(30) for the adiabatic ramp, which 
is less than 1 atom in a higher band per tube. The total number of atoms emitted 
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from higher bands changes with the pulse area because spurious emission from 
the higher bands is in the Markovian regime. 

Energy-shift data. The main motivation for the precise momentum calibration 
described above lies in the small size of the energy shift. Another challenge is the 
blurring of the distribution due to propagation effects for small coupling strengths 
(that is, long pulses). We use two measures for the determination of the energy 
of the emitted wavepackets: the squared separation of the wavepacket centres 
(extracted from the fit) and the second moment of the (centred) distribution. The 
accuracy of the peak-separation measure is limited by the fact that it ignores the 
physical broadening of the momentum distribution at larger coupling strengths 
and shorter times, whereas the second moment is sensitive to blurring of the wave- 
packets during detection. The data obtained using both methods are shown in 
Extended Data Fig. 3. In the non-Markovian regime, (2/A > 1, the peaks become 
indistinguishable (see the apex of the parabola in Fig. 2c) and a meaningful meas- 
ure of the shift cannot be extracted with either method. 

Numerical simulation of array effects. As stated in the main text, the Hamiltonian 
of the multi-site array* predicts the (resonant) transfer of excitations between 
neighbouring emitters®, in some similarity to the mechanisms for radiation trap- 
ping in an optically thick sample. Instead of attempting to solve this Hamiltonian, 
here we consider a simplistic model of an array of three emitters that is coupled to a 
quantized mode structure reflecting the weak longitudinal harmonic confinement. 
We chose our simplistic three-site model to have a centrally occupied site as the 
simplest spatially symmetric generalization of the isolated-emitter situation, which 
also corresponds to an ‘average segment’ in our system, with approximately one 
empty neighbour on each side of a populated site. 

We start from the Rabi Hamiltonian (in the rotating-wave approximation) and 
expand it to couple one or several sites ({r)) to many different, weakly confined 
levels ({b)). This Hamiltonian is (for simplicity only shown for two sites, but readily 
expanded to n sites) 


26; 0 2,4 2, 
: 0 26, 2D, QM,» 
A= a yy, My, —2A+w, 0 
—2A + 3w, 


Qy, Ls 0 


where 5,= mw; 17 /(2h) is a site-dependent detuning (that is, a site-dependent 
offset due to the weak harmonic confinement w, experienced by both 
lattice-trapped and free atoms) and +y,; represents overlaps between final- and 
initial-state wavefunctions (calculated numerically). We use modes up to a fixed 
frequency (Wmax= 20 x 5 kHz) and restrict ourselves to A + 202 < wmax- 

The results of the simulation for the Markovian parameters discussed in the 
main text are shown in Extended Data Fig. 4a, b. The simulated decay for an iso- 
lated emitter reproduces the prediction of our analytical model®°—small discrepan- 
cies arise from the fact that the latter neglects terms of order (A/w)? and (2/wy)” 


and higher, whereas our numerical simulation retains all orders. For the three-site 
array, the presence of neighbouring wells leads to a slowdown of the decay. The 
origin of this behaviour is reabsorption of emitted population by initially empty 
neighbours, as seen in Extended Data Fig. 4b. For the array considered, the process 
also leads to the formation of a temporary plateau in the overall site population, 
which may be related to the offset observed in the experiment. However, we cau- 
tion that the long-time decay occurs in a regime not accessible experimentally 
(we assumed a denser mode structure to extend the continuum approximation; 
see below), and also the dynamics may be different in an optical lattice extending 
over the entire mode volume. 

In Extended Data Fig. 4c we plot the dynamics in the extreme non-Markovian 
regime at the edge, where the coupling strength is much larger than the excited- 
state energy. In this case, we see that the single emitter displays oscillatory dynamics 
that quickly damps out and settles to a non-zero value, again in agreement with 
the analytical theory. On the other hand, the three-emitter array shows oscillatory 
dynamics of much greater amplitude and longer duration. We interpret this as a 
coherent enhancement of the dynamics through tunnelling to nearest neighbours. 
The range of this tunnelling diverges at the band edge, causing the marked differ- 
ence between single emitter and three emitters. 

From our simulations we gain additional insight into the effects of the quan- 
tized mode structure. As discussed in the main text, the mode structure should 
act like a true continuum for short enough times, where uncertainty should ‘wash 
out’ the levels. The simulations provide a quantitative test for this. The simplest 
comparison is for an isolated emitter in the Markovian limit. We see that for early 
times, the Markovian prediction® quantitatively agrees with the numerical solution, 
with a marked deviation (‘revival’) observable only at t" > 0.25[w,/(2)]~! (we 
restrict data taking to t< ft’ =2.6 ms in the experiments, in all but one case.) This 
is independent of the set harmonic trapping frequency and the location of the site 
in the array (centre or off-centre). We note that similar results are also obtained 
if the continuum is discretized by assuming a periodic-box-type potential, where 
the revival time depends on the length of the box. We furthermore note that our 
simulation does not reproduce 6, for long times. 

Data availability. The data that support the findings of this study are available 
from the corresponding author upon reasonable request. 


34. Pertot, D., Greif, D., Albert, S., Gadway, B. & Schneble, D. Versatile transporter 
apparatus for experiments with optically trapped Bose-Einstein condensates. 
J. Phys. B 42, 215305 (2009). 

35. Campbell, G. K. et al. Imaging the Mott insulator shells by using atomic clock 
shifts. Science 313, 649-652 (2006). 

36. Gadway, B., Pertot, D., Reimann, R., Cohen, M. G. & Schneble, D. Analysis of 
Kapitza-Dirac diffraction patterns beyond the Raman-Nath regime. Opt. 
Express 17, 19173-19180 (2009). 

37. Krinner, L., Stewart, M., Pazmifio, A. & Schneble, D. In situ magnetometry for 
experiments with atomic quantum gases. Rev. Sci. Instrum. 89, 013108 
(2018). 

38. Deng, L. et al. Temporal, matter-wave-dispersion Talbot effect. Phys. Rev. Lett. 83, 
5407-5411 (1999). 


© 2018 Springer Nature Limited. All rights reserved. 


LETTER 


Population 


-3 0 3 
A [kHz] 


Extended Data Fig. 1 | Average spectrum of coupling stationary atoms 
into the z lattice. The spectrum was generated from a series of 17 spectra 
taken over a one-day period, whose fitted centres are shifted to zero. The 
coupling strength is Q = 740(10) Hz and the pulse time is 400 j1s, with 
the data points binned into 300-Hz-wide bins. The solid curve is a fit to 
the data with 0 as a free-fitting parameter and the dashed curve has no 
free parameters. The effective coupling strength was calculated using the 
wavefunction overlap between free and trapped species, yo = 0.72. 
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Extended Data Fig. 2 | Raw momentum spectrum. The spectrum shows 
a detuning-independent, diffuse background of roughly 10? atoms. The 
spectrum was acquired as described in the main text, Fig. 2 and Methods; 
colour scale is identical to Fig. 2c. 
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Extended Data Fig. 3 | Raw data used to obtain the energy shift. 

a, b, Second moment of k (a) and half-separation squared (b) both 
subtracted by A/(27). The detuning A/(27) is 1.0 kHz (black disks), 

2.0 kHz (red triangles), 4.0 kHz (green squares) and 6.0 kHz (blue circles). 
Points in brackets correspond to the non-Markovian regime, Q/A > 1. 


© 2018 Springer Nature Limited. All rights reserved. 


LETTER 


I 1 4 b 


» 
= 


Population 


°o 


Population 


Evolution time [ms] 


Extended Data Fig. 4 | Simulated decay dynamics for a 1-site and a 
3-site model (with the central site initially populated). a, Dynamics 

of the two models, as depicted in the insets, for A= 27 x 1.9 kHz and 
Q=2n x 0.74 kHz, with w,= 27 x 0.1 kHz. b, Long-time decay dynamics 
of the 1-site (black) and 3-site (red) models for Q=27n x 0.74 kHz and 
A=2n xX 1.9 kHz, with w,=2n x 5 Hz. The dashed red line shows the 
population of the central, initially populated, site; the dotted red line shows 
the population of the neighbouring sites. c, Dynamics of the two models 
for A=—2n x 0.1 kHz and Q=2n x 3 kHz, with w,= 27 x 0.1 kHz. 
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Genetically encoded protein scaffolds often serve as templates for 
the mineralization of biocomposite materials with complex yet 
highly controlled structural features that span from nanometres 
to the macroscopic scale!“*, Methods developed to mimic these 
fabrication capabilities can produce synthetic materials with well 
defined micro- and macro-sized features, but extending control 
to the nanoscale remains challenging®. DNA nanotechnology 
can deliver a wide range of customized nanoscale two- and three- 
dimensional assemblies with controlled sizes and shapes”""!. But 
although DNA has been used to modulate the morphology of 
inorganic materials!”!3 and DNA nanostructures have served as 
moulds!*1> and templates!®!’, it remains challenging to exploit 
the potential of DNA nanostructures fully because they require 
high-ionic-strength solutions to maintain their structure, and this 
in turn gives rise to surface charging that suppresses the material 
deposition. Here we report that the Stéber method, widely used for 
producing silica (silicon dioxide) nanostructures, can be adjusted 
to overcome this difficulty: when synthesis conditions are such that 
mineral precursor molecules do not deposit directly but first form 
clusters, DNA-silica hybrid materials that faithfully replicate the 
complex geometric information of a wide range of different DNA 
origami scaffolds are readily obtained. We illustrate this approach 
using frame-like, curved and porous DNA nanostructures, with 
one-, two- and three-dimensional complex hierarchical architectures 
that range in size from 10 to 1,000 nanometres. We also show that 
after coating with an amorphous silica layer, the thickness of which 
can be tuned by adjusting the growth time, hybrid structures can be 
up to ten times tougher than the DNA template while maintaining 
flexibility. These findings establish our approach as a general 
method for creating biomimetic silica nanostructures. 

We illustrate our strategy using a DNA origami silicification (DOS) 
nanostructure with a pattern similar to the cell wall unit of diatoms 
(Supplementary Figs. 1 and 2), with Fig. 1a sketching the silicification 
process. First, we designed and constructed a DNA origami structure 
containing pores with theoretical inner diameters of 5.4 nm, 9.8 nm and 
82.4 nm, respectively (Supplementary Fig. 3). In contrast to the condi- 
tions under which previous double-stranded DNA (dsDNA)-guided 
silicification was performed!” , DNA origami nanostructures need 
a high concentration of cations such as Mg’* to maintain their struc- 
tural integrity®. But the high ion strength also prevents the attachment 
of N-trimethoxysilylpropyl-N,N,N-trimethylammonium chloride 
(TMAPS) onto the negatively charged phosphate backbone of the DNA 
that is needed for silicification (Supplementary Fig. 4). 

Molecular dynamics simulation revealed that hydrolysed clusters Tj» 
(where a and bare the numbers of TMAPS and tetraethyl orthosilicate 


(TEOS) molecules, respectively) with more than three TMAPS can 
overcome the electrostatic potential barrier and adsorb to the DNA 
backbone under high-ionic-strength conditions (Supplementary 
Fig. 5 and Supplementary Table 1). We therefore adapted the synthesis 
method to include a pre-hydrolysis step that promotes the formation 
of T,» clusters that are able to form an inductive layer around the DNA 
nanostructures, thus allowing for silica deposition. Optimized synthesis 
solutions had a TMAPS to TEOS stoichiometry set at 36.0 mM: 
90.0 mM (Supplementary Figs. 6-9 and Supplementary Table 2), which 
ensures that pre-folded DNA nanostructures can provide sufficient 
surface area and enough negative charge density to recruit T,, clusters 
before they self-aggregate with free TEOS in the bulk solution. This 
was confirmed by coarse-grained molecular dynamics simulations 
using a 198-base-pair (bp) three-dimensional DNA origami fragment 
along with a 348-bp two-dimensional DNA origami fragment that 
reacted with T,,» clusters (Fig. 1b and c, Supplementary Figs. 10-13 
and Supplementary Table 1). Our finding that the initial formation of 
small clusters is an essential step in the silicification process is in accord 
with earlier observations, which concluded that amorphous silica nano- 
particle synthesis involves colloidally stabilized primary particles that 
associate with and subsequently add to the silica structure’®. 

Figure 2 gives examples of DNA origami nanostructures of differ- 
ing complexity characterized by conventional transmission electron 
microscopy (TEM) and atomic force microscopy (AFM) before and 
after silicification. TEM imaging of simple DNA origami triangles in 
Fig. 2a reveals the successful formation of both the original DNA ori- 
gami templates and the DOS nanostructures, with measured average 
edge lengths of the triangular DNA origami frameworks and DOSs 
of 120.2 +2.3 nm (N=20) and 122.9+4.1 nm (N= 20), respectively. 
The average DNA origami edge width was measured as 22.6 + 1.5 nm 
(N=20) while the designed edge width and measured DOS edge widths 
were 26.0 nm and 25.9 + 1.4 nm, respectively, indicating edge shrinking 
for the stained pure DNA sample under TEM of around 3.4 nm. 
Such shrinking is often seen when DNA origami is dried in air on diverse 
substrates”! (see Supplementary Fig. 14, which also documents the 
height changes and dependence of changes on imaging mode and con- 
ditions). The heights and side lengths of the triangular DOS, based on 
AFM and TEM images, show the formation of a homogeneous thin silica 
coating layer with uniform thickness (mean value 3.1 nm for the z axis and 
2.7 nm for the x-y plane, Fig. 2b and Supplementary Figs. 15-18; see also 
Supplementary Fig. 19, with additional discussion on surface roughness). 
The sample composition was analysed by energy dispersive spectrometer 
(EDS) mapping, with an overlay of the obtained elemental mapping images 
with scanning electron microscope (SEM) images further confirming the 
uniform deposition of silica on the triangular DNA framework (Fig. 2c). 
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Fig. 1 | Schematic illustration and molecular dynamics simulation of 
DOS strategy. a, The reaction process of the DOS diatom-mimicking 
structure. The silicification was achieved through mixing DNA framework 
with prehydrolysed TMAPS and TEOS. The prehydrolysing cluster has 
multiple TMAPS molecules (noted as Ty», where a, b=0, 1, 2,3 ...). The 
configurations of the individual DNA double helices in the complex DNA 
origami templates could be identified as single- and multi-layered strands. 
In both cases, the silica could not be inserted into the spaces between the 
DNA helices, because the precursor molecule was much bigger than the 
gaps (about 5-10 A) between the two adjacent dsDNA molecules. Also, 
there was a high Mg’* density inside DNA nanostructures. b, Molecular 
dynamics simulation shows the DOS process whereby T,y clusters (for 
example, T,) adhere onto the DNA nanostructure surfaces. A 198-bp 


To demonstrate that our method retains the ability of DNA 
architecture-encoded nanofabrication for controlling sample 
morphology with nanometre resolution!*!», we built a set of DNA 
origami squares containing nanopores with sizes ranging, theoretically, 
from 31 nm x 44nm to 25 nm x 33nmand 13 nm x 11 nm (not taking 
the un-hybridized M13 fragment and flexibility of the DNA nano- 
structure into consideration; see Supplementary Figs. 20-22 for further 
design details). As illustrated in Fig. 2d, measured origami pore sizes 
were 639.9 nm’, 283.8 nm? and 103.6 nm? from class-averaged TEM 
images (Supplementary Figs. 23 and 24). After silicification, pore sizes 
changed to 608.8 nm?, 255.7 nm? and 87.5 nm? as determined from 
representative averaged TEM images (N= 339, Supplementary Fig. 25); 
these values correspond to diameters of 27.8 nm, 18.4 nm and 10.6 nm 
for a circular pore, and to side lengths of 24.7 nm, 16.0 nm and 9.4 nm 
for a square opening. 

Our method can in principle create single nanopores down to3 x 3nm?, 
which is based on the diameter of the original DNA helices and the 
thickness of the silica layer after growth. This is in fact illustrated by the 
diatom nanostructure, finally shown in Fig. 2e (see also Supplementary 
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three-dimensional DNA origami fragment was used as our model. Mg?* 
(green), DNA strands (blue), water box (light blue) and T,, clusters 
(orange) are shown. c, The simulation revealed that a minimal amount of 
T3 precursor (a > 3) is required to compete with the high concentration 
of cations (for example, 12.5 mM Mg?*) that was required by the DNA 
nanostructures. However, over-hydrolysed clusters such as T3g tend to 
self-aggregate before being recruited by the DNA origami. The normalized 
number of local nitrogen ions (that is, equal in concentration to the 
TMAPS, within an 8 A range of the DNA origami surface), is higher than 
the average number of nitrogen ions (36) found in the free solution that 
contains the DNA origami, which indicates the successful attachment of 
the clusters onto the DNA backbones. 


Fig. 26), with the negatively strained DNA nanostructure before silica 
deposition seen to have a less defined morphology and twisted outer 
edges. In contrast, the class-averaged TEM image (N= 20) reveals well 
replicated outer edges and two sets of inner pores with diameters of 
about 5.5 nm and 3.4 nm that agree well with the design (more details 
in Supplementary Figs. 27 and 28 and Supplementary Tables 3-5). The 
area of the smaller pore, about 12.0 nm?, is much smaller than the 
approximately 47.8 nm” previously reported for the smallest nanopore 
obtained by the DNA origami lithography method” and represents a 
resolution comparable to that achieved with electron beam methods”. 

Beyond accurately controlling the final silica shell thickness and 
pore size, our strategy provides a general and robust procedure for 
building rigid three-dimensional inorganic structures in arbitrary 
shapes that are difficult to create using lithography or polymer coat- 
ing!*15:17.21_ This is illustrated in Fig. 3a, which shows the basic designs 
and negatively stained DNA architectures (see also Supplementary 
Figs. 29-31) and Fig. 3b and c, which shows the corresponding DOS 
structures that include two-dimensional units (square, triangle, cross 
and diatom-mimicking), the two-dimensional honeycomb pattern and 
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Fig. 2 | Geometrically precise control of DOS structures. a, A triangular 
DOS nanostructure as a representative structure showed near-perfect 
shape replication from DNA origami templates to DOS nanostructures 
(schematic at top); the bottom left panel shows the negatively stained TEM 
image of the DNA origami templates; the bottom middle panel shows the 
TEM image of the DOS triangle; and the bottom right panel is the class- 
averaged TEM image of the DOS triangle. Scale bars, zoomed-out 200 nm 
and zoomed-in 50 nm. b, Statistics of the heights (from AFM) and lengths 
(from TEM) for DOS nanostructures with increasing growth time. Error 
bars, s.d. c, EDS mapping of the DNA-silica hybrid nanostructure 

(Si is coloured in orange, O in green and P in blue; SE, secondary-electron 
image). Scale bar, 50 nm. d, A user-specified DOS nanopore with the 


three-dimensional objects (cube, tetrahedron, hemisphere, toroid and 
ellipsoid). The parameters of each DOS structure measured from TEM 
images were compared with their original DNA framework design in 
Supplementary Table 6. See Supplementary Figs. 32-56, Supplementary 
Videos 1-3 and additional discussion for details on characterization, 
comparison to previous methods, growth kinetics and yields. 
Although the mechanical properties of DNA nanostructures are 
important in DNA-assisted assembly and nanofabrication””” and the 
tensile strength of DNA has been probed”**, little is known about the 
mechanical properties of DNA under pressure*”®. We found, using 
AFM Peak Force toolkit (see Supplementary Information, Methods) 
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smallest diameter down to less than 10 nm. From left to right, panels show 
the design, AFM images for pure DNA origami, the class-averaged TEM 
images, and the size distribution of silica nanopore measured from raw 
TEM images, respectively. Histograms of three different pore sizes were 
normalized and fitted to Gaussian distribution curves (light blue), and the 
red dashed lines indicate direct measurements of the pore sizes from class- 
averaged TEM images. e, DOS diatom nanostructures. From left to right, 
panels show the design model of a DNA origami template, AFM images 
before and after silicification, and the averaged TEM image of a DOS 
diatom nanostructure, respectively. Scale bars, 50 nm. All unmarked units 
refer to nanometres. 


measurements, a tenfold increase of the Young’s modulus (E modulus) 
from around 100 MPa for a DNA origami to around 1 GPa for the 
corresponding two-dimensional DOS triangle, and a greatly enhanced 
toughness that could withstand setoff forces up to 3,000 pN (Fig. 4a 
and b, Supplementary Figs. 57-64). Although the DOS modulus is below 
the values of 10-100 GPa reported for nanoporous silica”’, there might 
be scope for improving it. Figure 4c details the mechanical response 
of a three-dimensional DOS tetrahedron under compression, which 
was measured by Peak Force quantitative nanomechanical mapping in 
fluid mode and revealed an increase in the average peak heights of the 
samples from 16.5 nm, 22.9 nm and 39.7 nm to 52.1 nm (V=10nN 
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Fig. 3 | Representative complex DOS nanostructures. a, Designed 
models and corresponding TEM images of the DNA origami templates. 
From left to right, panels show the two-dimensional structures (square, 
triangle, cross and DOS diatom) and three-dimensional objects (cube, 
tetrahedron, hemisphere, toroid and ellipsoid). b, The corresponding 
DOS nanostructures. From top to bottom, panels show TEM images 


and 1.0 nN) after 0-, 1-, 2- and 5-day silica growth, respectively 
(Supplementary Figs. 65-67). Although the maximum measured height 
was below the theoretical height for a rigid ideal tetrahedron (74.0 nm), 
the results nonetheless illustrate that pure DNA origami nanostruc- 
tures placed on a mica surface deform dramatically and that DOS 
nanostructures have enhanced rigidity that increases with silica 
growth time. 

With the dsDNA tetrahedron reported to maintain its structural 
integrity*® up to a maximum compression force of about 100 pN, 
we next studied the load-deformation behaviour of a tetrahedron 
DOS and found no obvious shape changes for forces up to 1,000 pN 
(Supplementary Fig. 68). Even a 3.0 nN load force only bent the struc- 
ture without damaging it. As shown in Fig. 4d and Supplementary 
Fig. 69, the larger applied tip forces can lead to greater bending of the 
DOS tetrahedrons that had experienced 5 days of silica growth. 
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(zoomed-out and zoomed-in), SEM images and EDS mapping images 

(Si is coloured in red, O in green and P in blue; SE, secondary-electron 
image). c, Silicification of a honeycomb two-dimensional pattern. Left: 
negatively stained TEM image of DNA-only assemblies; right: TEM, SEM 
and EDS images of the DOS honeycomb. Scale bars, zoomed-out 100 nm 
and zoomed-in 50 nm. 


As force-deformation responses showed a nearly ideal elastic 
behaviour, we further explored the elastic properties of the DOS tetra- 
hedrons by applying force jumping cycles between 3.0 nN and 1.0 nN 
(Fig. 4e) and found that DOS tetrahedrons withstood at least 5 cycles 
and restored 80% of their initial height before being irreversibly dam- 
aged (Supplementary Fig. 70). The DOS nanostructures thus possess 
toughness and structural flexibility, that is, mechanical properties typ- 
ically found in biominerals”®. We attribute the enhanced rigidity of 
the DOS nanostructures (see also Supplementary Figs. 71-77) to the 
amorphous silica shell, with further structural support provided by 
water when in a liquid environment and by the bonds between the 
DNA molecule and the silicon—oxygen tetrahedron of the inorganic 
phase. In a final exploration of silicification-enhanced rigidity, we 
attached gold nanorods to two edges of the tetrahedra and found that 
the free-standing DOS-Au nanorods structure retains a distortion-free 
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Fig. 4 | Nanomechanical studies on DOS nanostructures. a, Young’s 
modulus (E modulus) derived from force curves, using a membrane 
substrate effect correction (MSEC) model. The E modulus of a DOS 
triangle after 5 days’ silicification was about ten times greater than the 
original DNA nanostructure. b, Comparison between a Derjaguin- 
Muller-Toporov (DMT) model and an MSEC model. “6max/height’ 
indicates that the maximum indentation depth was divided by the sample 
thickness; E/E.ample indicates the dispersion of the E modulus when 
taking substrate effects into consideration. The E modulus data that were 
deducted from the MSEC model were independent of thickness and force. 
c, AFM-tip-induced deformation of a DOS tetrahedron. The pure DNA 
tetrahedron was deposited on the mica substrate, with a maximum height 
of 15.2 nm. Samples aged for 1, 2 and 5 days showed increasing heights 

of 21.9 nm, 44.1 nm and 48.5 nm correspondingly and were able to stand 


tetrahedral geometry while the DNA-origami-Au nanorods structure 
collapsed (Fig. 4f, Supplementary Figs. 78-80, Supplementary Video 4). 
This suggests that DOS frameworks could in principle support metal 
nanoparticles in three-dimensional space with defined angles and cou- 
pling distances that might be of interest for developing nano-electronic 
and nano-photonic devices, with distortion-free nanoscale geometries 
much better than have been achieved in previously reported DNA- 
metal nanostructures’. 

In closing, we note that one long-standing aim of nanotechnol- 
ogy is full control over structural features with nanometre or even 


freely on the mica surface. The AFM tip force used for the measurement 
was 1.0 nN. The arrows indicate the height section directions. Scale 

bars, 50 nm. d, Height changes during the deformation of a single DOS 
tetrahedron that was grown for 5 days, under increasing tip forces, showed 
changes from 1.0 nN to 3.0 nN. Error bars, s.d. e, Left, schematics of the 
DOS tetrahedron spring. Centre, AFM images (tapping in fluid) of DOS 
tetrahedron after 5 days’ silicification. Cycling forces were applied to a 
single DOS tetrahedron and the recovery of structures further confirmed 
the unique elastic mechanical property of the DOS nanostructures. 

Scale bars, zoomed-out 200 nm and zoomed-in 50 nm. f, A free-standing 
DOS-Au nanorod tetrahedron that faithfully replicates the original design, 
as compared to a DNA origami tetrahedron supported nanorod structure, 
which collapsed onto the surface owing to its flexibility. Left, SEM image; 
centre, STEM image; and far right, TEM image. Scale bars, 50 nm. 


atomic-level precision. DNA has been much studied in that context as it 
can be readily programmed to assemble into well defined customized 
nanostructures of considerable complexity. The method we have pre- 
sented here offers the possibility of transferring nanoscale geometric 
information from designer DNA nanostructures (especially frame- 
work nucleic acids) to inorganic materials, illustrated by the successful 
formation of silica-DNA composites with a wide range of different 
shapes and inorganic coating thicknesses that control the rigidity of 
the structures. Provided suitable synthesis conditions can be found, 
it should in principle be possible to create composites where other 
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inorganic materials coat DNA origami shapes; or even multi-compo- 
nent composites, through stepwise deposition of more than one inor- 
ganic material on DNA templates?!*??°, 


Data availability 

All data generated or analysed during this study are included in the paper and 
its Supplementary Information, and are available from the corresponding author 
on request. 
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soil depth 
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The exchange of carbon between soil organic carbon (SOC) and the 
atmosphere affects the climate’? and—because of the importance 
of organic matter to soil fertility—agricultural productivity’. The 
dynamics of topsoil carbon has been relatively well quantified*, 
but half of the soil carbon is located in deeper soil layers (below 
30 centimetres)*-’, and many questions remain regarding the 
exchange of this deep carbon with the atmosphere®. This knowledge 
gap restricts soil carbon management policies and limits global 
carbon models!»!". Here we quantify the recent incorporation of 
atmosphere-derived carbon atoms into whole-soil profiles, through 
a meta-analysis of changes in stable carbon isotope signatures at 112 
grassland, forest and cropland sites, across different climatic zones, 
from 1965 to 2015. We find, in agreement with previous work>®, 
that soil at a depth of 30-100 centimetres beneath the surface (the 
subsoil) contains on average 47 per cent of the topmost metre’s SOC 
stocks. However, we show that this subsoil accounts for just 19 per 
cent of the SOC that has been recently incorporated (within the 
past 50 years) into the topmost metre. Globally, the median depth 
of recent carbon incorporation into mineral soil is 10 centimetres. 
Variations in the relative allocation of carbon to deep soil layers 
are better explained by the aridity index than by mean annual 
temperature. Land use for crops reduces the incorporation of carbon 
into the soil surface layer, but not into deeper layers. Our results 
suggest that SOC dynamics and its responses to climatic control or 
land use are strongly dependent on soil depth. We propose that using 
multilayer soil modules in global carbon models, tested with our 
data, could help to improve our understanding of soil-atmosphere 
carbon exchange. 

The size of the Earth SOC reservoir is estimated to be around 
1,500 gigatonnes of carbon (Gt C) in the first metre, excluding perma- 
frost areas°, making it a huge potential source or sink for atmospheric 
carbon (levels of which are increasing by +4.4 Gt C per year)!'. The 
future response of this soil compartment could substantially affect not 
only the climate but also global food production (through the role of 
organic matter in soil fertility), as well as the stability or resilience of 
ecosystems’. About half of this carbon is located at depths below 30 cm 
(refs *°). However, although the dynamics of topsoil carbon has been 
relatively well quantified, especially thanks to long-term experiments 
carried out over generations’, major questions remain about how 
to estimate changes in deep-soil carbon and the processes involved. 
Decision-makers and ecosystem managers are thus deprived of any 
references for the management of the deep carbon stock. Similarly, 
when modelling the Earth system and the global carbon cycle, the 
scientific community also constantly faces the problem of modelling 
the dynamics of deep carbon?!®!, 

Neither absolute changes in carbon stocks nor carbon fluxes in the 
deep soil horizons can be quantified by direct measurement. Owing 
to the very low carbon concentrations (on average less than 5 g kg! 
at depths of 80 cm), spatial heterogeneity and slow changes, temporal 
variations in stocks are smaller than measurement accuracy. Evidence 


for deep carbon changes is therefore exceptional!*"*. Information on 
incoming fluxes resulting from root mortality and exudation by living 
roots is not accessible without tracers. In addition, in situ quantifica- 
tion of the outflow from the organic reservoir—which occurs mainly 
through heterotrophic respiration by organic-matter decomposers, 
in the form of CO, production—is very difficult, if not impossible, 
because the CO) efflux blends heterotrophic respiration and root auto- 
trophic respiration!>. Isotopic methods are therefore appropriate for 
tracing deep carbon dynamics. The radiocarbon age of deep carbon 
is indicative of its slow turnover!*!°, but 4C dating, which provides 
mean ages, does not estimate the exact proportions of active and stable 
carbon!”°. Here we propose a stable-isotope-based observation of the 
actual depth distribution of soil carbon ages. It relies on sites that are 
marked by a natural change in the '°C/'*C ratio of the vegetation at a 
known date. This is equivalent to the continuous in situ labelling of 
the atmospheric carbon atoms that have been incorporated into soil 
organic matter for a known duration, have eventually replaced pre- 
existing organic carbon, and have been retrieved at the date of 
sampling". 

We conducted a meta-analysis of 112 such sites (Extended Data 
Fig. 1), where the labelling ranged from 4 to 4,000 years. At each site, 
the technique provides an indication of carbon age—that is, the pro- 
portion of carbon that is younger than the labelling duration; meta- 
analysis of similar sites with varied durations provides an age prob- 
ability distribution!”. Our study includes most of the world’s biomes 
except boreal zones, and is evenly distributed among forests, grasslands 
and croplands. 

We quantified carbon distribution in the two-dimensional age-depth 
continuum”’, the depth distribution of carbon incorporation in soil 
over the past 50 years, and the dependence of these factors on climate 
and land use. We also summarized depth distributions in terms of two 
layers, 0-30 cm (topsoil) and 30-100 cm (subsoil)—an arbitrary cut- 
off, but one that is often used in carbon inventories®. Our results, which 
are based on original observations, are independent of any datasets or 
modelling results from other studies. 

Figure 1 depicts individual data showing the proportion of new 
carbon—that is, the proportion of SOC that derives from new vegetation— 
as a function of time. At all depths, a minor proportion of soil carbon 
is renewed rapidly (within ten years). Nine sites at which a vegetation 
signature change occurred more than 1,000 years ago reveal the incom- 
plete replacement of carbon, that is, the presence of millennia-old soil 
carbon, at depth but not in the topsoil. 

The rate of carbon incorporation in the topsoil was, as expected, 
strongly dependent on environmental variables, in particular land use 
(P < 0.001) and mean annual temperature (MAT; P < 0.05) (Extended 
Data Table 1). For the subsoil, by contrast, we found no relationship 
between carbon age and land use, and only a weak relationship with 
temperature (P = 0.1); instead, carbon age was more affected by the ratio 
of precipitation to potential evapotranspiration” (P < 0.01; Extended 
Data Table 2). This observation reinforces the results of ref. ?, which 
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Fig. 1 | Observed proportions of new carbon in 112 soil profiles. For 
each soil sample, the proportion p of new carbon atoms was determined 
by the change in the soil carbon /%C signature following a change in the 
SC signature of the vegetation for a given duration f; p is the proportion 
of carbon atoms younger than #”!. Data are presented in four classes of 
duration t. 


showed that the relationship between ecosystem carbon turnover time 
and precipitation is pervasive and underestimated by models. 

To analyse the age distribution with depth under comparable envi- 
ronmental conditions, we selected a homogeneous subset of sites, 
namely a group of forests and grasslands under warm and moist 
climates (with MATs higher than 17°C, annual precipitation of more 
than 1,000 mm, and precipitation/evapotranspiration ratios greater 
than 0.8). Figure 2 and Extended Data Table 3 depict the detailed 
depth distribution of carbon ages throughout this panel of soils. This 
description of carbon dynamics in time-depth space highlights its 
strong dependence on both variables. The dynamics of subsoil carbon 
is around seven times slower than that of topsoil carbon (that is, it takes 
seven times longer to reach the same proportion of renewed carbon; 
Extended Data Fig. 2). In deep layers, the age distribution reveals the 
small but non-negligible direct incorporation of photosynthetically 
fixed carbon through deep roots or soluble carbon (for the youngest 
carbon), and the predominance of carbon that is older than 1,000 years. 
Mid-profile horizons (20-70 cm) are dominated by carbon of inter- 
mediate ages (100 to 1,000 years), which can be considered to result 
from the slow downward movement of carbon!®*, Carbon incorpora- 
tion in the 100-200-cm layer has been quantified in only a few studies 
and averaged 5 + 3% (1 standard deviation) of soil carbon after 50 years. 

We calculated the amount of carbon incorporated into each layer 
(Crew in units of kg C m~’) for each site. In our database, the SOC 
found in the subsoil layer represents 47% of the total stock found in 
the entire top metre of soil, in agreement with the percentage of 47% to 
52% reported globally*®. To express the incorporation of new carbon in 
depth on the basis of a single indicator, we chose the ratio R39-190, which 
is Crew(30-100 cm)/Chew(0-100 cm), and analysed its dependence on 
land use, climate and time in the 0- to 200-year-old sites (Extended 
Data Table 4). We found that R39-19 is strongly dependent on land use 
(P < 0.001). The mean values of R39-199 (50 years) are 19%, 21% and 
29% for forests, grasslands and croplands, respectively. The relatively 
deeper carbon incorporation in croplands concerns all layers below 
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Fig. 2 | Meta-analysis of carbon age distribution over 55 tropical 
grassland and forest soil profiles. At each depth, the proportion of 
carbon aged less than time t (3 years, 10 years and so on) was fitted by 

a bi-exponential regression of t (Extended Data Table 3). Grey bands 
represent +1 standard error of the estimated mean. The median age of 
soil carbon increases from seven years at depth 0 cm to 1,250 years at 
100 cm. Integration of the carbon content in each layer demonstrates 
that the carbon of age less than 50 years represents 45% of topsoil carbon 
(0-30 cm) and 13% of deep carbon (30-100 cm). 


a depth of 10 cm and cannot be explained only by soil mixing due to 
ploughing, as the depth of this mixing does not exceed 30 cm (Fig. 3). 
Croplands incorporate less new carbon in their topsoils than do forests 
and grasslands, whereas in subsoil the amount of incorporated carbon 
is similar (Extended Data Tables 1 and 2). This is consistent with the 
general reduction in carbon input at the soil surface”® that results from 
the removal of above-ground biomass during harvesting. R39-190 also 
depends on the precipitation/evapotranspiration index (P < 0.005), 
and is weakly dependent on MAT (P < 0.1; Extended Data Table 4), 
in accordance with the deeper rooting that takes place under dry 
climates, and possibly the more frequent occurrence of deep soils at low 
latitudes. The world average value of R30-100 (50 years) is 19% (14%; 
95% confidence interval) (Fig. 3). The overall shallow incorporation of 
carbon can be expressed by the median depth of carbon incorporated 
in the last 50 years: 9 + 1 cm (95% confidence interval) in forests, 
10 + 2 cm in grasslands and 17.5 + 1.5 cm in croplands in our panel 
(9.7 + 1.2 cm on average globally; Extended Data Table 5). Taking 
into account the 100-200-cm layer (when observed) would lower this 
median depth by 0.5 cm. 

This study provides an unprecedented estimate of, first, the SOC age 
distribution over the soil profile (Fig. 2), and second, the depth distribu- 
tion of the carbon transferred from the atmosphere to soils (Fig. 3). The 
carbon-incorporation profiles can be compared with existing profiles of 
root biomass and above-ground inputs. The proportion of carbon that 
we found to be allocated to the subsoil is higher than the corresponding 
proportion of root biomass compiled in meta-analyses**°. This can 
be explained on the one hand by the contribution of root exudates in 
addition to root mortality’’, and on the other hand by reduced decay 
rates at depth”4. The reduced decay rates could be related to several 
interacting processes, for example, reduced and scattered microbial 
biomass’, stabilization by minerals*'’, and a reduced priming effect 
(the latter being the stimulation of SOC decomposition by the supply 
of fresh carbon)”*. 
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Fig. 3 | Depth distribution of the carbon that has been transferred from 
the atmosphere to soil organic matter between 1965 and 2015. The 
amount of carbon per 10-cm increment is expressed as a proportion of the 
total carbon incorporated in the top metre. The value for each land use is 
the mean of the observed profiles, and the value for the whole Earth was 
estimated by multivariate linear model extrapolation to the world’s biomes. 
Error bars represent the 95% confidence interval of the mean or estimate; 
within each increment, land uses followed by the same letter (a or b) do 
not differ significantly. The small shift between the global estimate and the 
observed values reflects the differences in soil-climate conditions between 
the global average and the observation panel. 


We measured the depth distribution of atmosphere-derived carbon 
incorporation over the past 50 years (50-year input). The depth 
distribution of the net change in soil carbon in the same time interval 
also depends on the loss of carbon older than 50 years during the 
period (50-year output). In steady-state systems, the depth distribu- 
tion of outputs would perfectly equal the depth distribution of inputs. 
But real systems are transient as a result of global changes in either 
carbon inputs (for example, increased net primary production, reduced 
carbon returns because of land-use change) or decay rates (for example, 
because of warming). On the basis of our meta-analysis, we argue that 
the depth distributions of carbon output and of carbon incorporation 
are very similar even in transient systems, for the following reason. 
In non-steady-state systems, the delay associated with the downward 
movement of carbon may be suspected to result in 50-year outputs that 
are deeper than 50-year inputs, in a ‘conveyor-like’ dynamic system. 
But the R3o.199 ratio increases very slowly with time (by less than 0.001 
per year; Extended Data Table 4). This means that the movement of 
carbon is slow and affects only long-term carbon dynamics, far later 
than the change expected in future decades. The depth distribution 
of net changes could differ from our distribution of new carbon only 
under the pressure of a driving force that affects old carbon in a very 
different way to the new carbon, such as de-freezing”? or major changes 
in deep carbon inputs leading to additional priming effects”®. 

Our study also reveals that the steep age gradient with depth (Fig. 2) 
could be a source of bias in the representation of carbon dynamics 
if depth is not handled properly. For instance, if we consider three 
commonly used reference layers—0-10 cm, 0-20 cm and 0-30 cm—we 
find that their median ages differ considerably, being 23, 50 and 92 
years, respectively. Projecting the decay-rate parameters observed in the 
topmost part of soils onto thicker layers would bias future projections of 
changes in carbon. The kinetics of carbon incorporation further reveals 
a substantial turnover over the time range of centuries (Figs. 1, 2 and 
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Extended Data Fig. 2)—that is, between the ‘decadal’ and ‘millennial’ 
compartments of present carbon models’”*°—arguing for a more real- 
istic description of carbon storage in terms of continuous time ranges*”. 

Our results show that SOC dynamics and their responses to climatic 
control or land use are strongly depth dependent. A better representa- 
tion of deep carbon dynamics has been called for, together with other 
processes, to improve ecosystem carbon models”'”!°. Our observations 
support the use of multilayer SOC modules in Earth system models, 
which our data could help to test. 
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METHODS 


Study sites. We compiled published datasets from 47 peer-reviewed arti- 
cles, together with original data, on mineral soil '*C/!?C changes in places 
where the '°C/C ratio of the vegetation has been shifted for known durations 
(see Supplementary Information). We analysed a total of 112 pairs of mineral soil 
profiles: 108 pairs in which the predominant vegetation has changed from the 
C3 photosynthetic type to the C4 type, or vice versa, and four pairs from free-air 
carbon-enrichment (FACE) experiments, where the °C signature of added carbon 
dioxide has labelled plant-derived material (Extended Data Fig. 1; see references 
in the Supplementary Information). At each site, two plots with a common history 
(one with changed and one with unchanged vegetation) were analysed. The isotopic 
difference between the two profiles was used to calculate the proportion of new 
carbon through an isotope-mixing equation, which is not biased by additional 
isotopic effects in soils”!. 

Most of the world’s biomes are represented; the land uses include grasslands 
and savannahs (34%), forests and woodlands (30%), and annual and perennial 
crops (36%), from 24 countries between latitudes 29° S and 57° N (Extended Data 
Fig. 1). We selected studies that fulfil the following criteria: the age of change should 
be known or estimated; the observed depth should be more than 60 cm or reach 
bedrock; and the difference in the §°C of the vegetation between the reference 
and the study site should be 5%o or more in the case of mixed vegetation that 
includes both photosynthetic types. The duration of vegetation change ranged from 
4 years to 4,000 years. Authors estimated the dates of change through controlled 
experiments, enquiries, historical records, or airborne surveys. Changes in isotope 
signature that occurred more than 1,000 years ago (nine sites) were associated with 
interacting climate- and man-induced changes in vegetation. In those cases, dates 
were estimated by the authors from local or regional proxies of palaeovegetation 
change (for example, pollen/charcoal combined with radiocarbon dating). When 
the period after vegetation change was expressed by the authors as a range (for 
changes older than 200 years), we used the mid-value of the range. 

Mean climatic data were obtained either from data reported in the article 
(n= 103) or, if missing (n = 9), from the CRU Group/Oxford/International Water 
Management Institute (IWMI) 10-minute mean climate grids for global land areas 
for the period 1961 to 1990 (ref. *!). We compared grid versus declared climatic 
data in the database: for annual precipitation, the mean CRU grid/declared ratio 
is 0.98 + 0.15 (standard deviation); for MAT, the mean difference between CRU 
grid and declaration is —0.15 + 1.1°C. Topsoil clay content was either obtained 
from authors’ statements or assumed to be the median value of the texture class 
mentioned. Mean annual aridity indexes, P/PET (annual precipitation/potential 
evapotranspiration)”>—a better indicator of hydric impact on both net primary 
production and microbial activity than precipitation alone—were obtained from 
the Food and Agriculture Organization 10-minute mean climate grids for global 
land areas for the period 1950 to 2000 (ref. 7°). 
Proportion of new carbon and data pre-processing. For each site, the natural 
13C-labelling technique uses two plots, which were initially identical and have 
become differentiated during the last t years by two types of vegetation that differ 
in their §'3C. We use the terms ‘reference’ (‘ref’) for the plot at which the vege- 
tation type at the date of sampling is the closest to that of the initial vegetation, 
and ‘studied plot’ (‘s’) for the plot with the new type of vegetation. Most authors 
described carbon content and isotopic data profiles as successive layers, each one 
sampled between two depths (z}, 2). For each layer (z, Z2), we define C as the 
carbon stock in the horizon (in kg m~7); fnew as the proportion of new carbon 
(that is, derived from the new vegetation) (Fig. 1); and Cyew as the stock of new 
carbon in the horizon (in kg m’).C, fnew and Chew were either obtained from 
the authors’ papers (n = 30), or calculated from observed variables as follows. 
C was calculated from carbon concentration, [C] (in mg g'), and bulk density, p, 
according to C= [C] x p x (2. — 2), where p was either from the authors’ data or 
(in 41 cases) estimated from [C] according to Alexander’s” equation. fand Chew 
were calculated according to the equations”!: 


Fo, = (Ss0il, —8s0il,<,) /Adveg () 


Crew =Srow XC 


where 6soil, and Ssoil,er are the §'°C values of SOC from the study and reference 
plots; and Aveg is the difference in vegetation 6'3C between the study and ref- 
erence plots and was determined from plant or litter samples. The dsoil;e in each 
horizon was obtained from the reference soil collected at the same depth as the 
soil of the study plot. In accordance with the limit of resolution of the method, 27 
horizons in deep layers had negative fnew Values; in this case, we considered Chew to 
be 0. The resulting overestimation of average new carbon was negligible. In cases 
in which the sampling depth differed at the reference and studied plots, we calcu- 
lated soil, by linear interpolation of the two nearest observed depths. Equation 
(1) typically accounts for the various '°C enrichments that occur during organic 
carbon decay or historical changes”', with the sole criterion that these enrichments 
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are similar in the study and reference soils. Equation (1) neglects the dark fixation 
of carbon atoms* that would have the isotopic composition of atmospheric CO>. 
Depth distribution of new carbon. We calculated depth distributions for the sub- 
set of sites whose labelling duration was 200 years or less (n = 99; Fig. 3). The mean 
duration was 35 years. In order to compare similar depth intervals, we calculated 
the three variables fnew, cumulative carbon stock with depth C(0, z) and Cpew(0, Z) 
10-cm increments by linear interpolation of the observed horizons. For each 10-cm 
depth interval (z, z+ 10 cm), we computed the ratio R = Cyew(z, + 10 cm)/Crew(0, 
100 cm). When bedrock or the R horizon was described, a nil carbon content was 
attributed to these horizons. When profiles were not described down to a depth 
of one metre ( = 31; most often 80 cm), C(0, 100 cm) was extrapolated from the 
maximum depth Zmax using the linear regression C(0, 100 cm) = a x C(0, Zmax) + 8 
over the entire dataset and similarly for Cyew(0, 100 cm). 

The median depth Zmedian of new carbon was calculated for individual profiles 
aS Crew(0; Zmedian) = Cnew(0, 100 cm)/2, by linear interpolation in the observed 
Crew(0, Z) function. 

The variance of the ten ratios R = Chew(z, z+ 10 cm)/Cyew(0, 100 cm) at the ten 
depths z = 0, 10,..., 90 cm, the ratio for the whole subsoil Cyew(30, 100 cm)/Crew(0, 
100 cm) and the median depth of new carbon were analysed by multivariate linear 
regression of time, land use and climatic variables (Extended Data Tables 4 and 
5). Given that the average start date of labelling was 1965, we consider that the 
regression value of R for time = 50 years stands for carbon incorporated in the 
time interval 1965-2015. World average values of carbon incorporation in deep 
soil layers, excluding permafrost areas, were obtained by weighting multivariate 
linear regression estimates of new carbon (Extended Data Tables 1, 2, 4 and 5) by 
the biome soil carbon inventories in ref. °. Multivariate linear regression used the 
mean value of each of the 112 observed profiles, with no weighting for the number 
of replicates or horizons, leading to less precise but unbiased estimation. When 
replicated, profile variability is provided in the database in the Supplementary 
Information. We used bootstrap procedures™ to express confidence on the esti- 
mated depth distribution or median age for the globe (Fig. 3 and Extended Data 
Table 6), or on the depth distribution of ages in tropical grasslands and forests 
(Fig. 2 and Extended Data Table 3). For that purpose, we drew N = 100,000 inde- 
pendent profile bootstrap samples from the observed profiles. For each bootstrap 
sample, relationships with P/PET, MAT, land use and time were recomputed and 
used to calculate the values of the variables of interest. Standard deviations were 
then estimated as the standard deviation of these 100,000 values. 

Statistical analyses were performed using the R packages Boot and Stats version 

3.4.3, 
Analysis of the inference of vegetation change on the results. The naturally 
labelled sites experienced varying degrees of perturbation compared with pris- 
tine ecosystems. Vegetation change may modify input or decay rates, leading to 
transient carbon dynamics. To investigate whether these changes themselves affect 
the depth distribution of new carbon, we tested the dependence on two additional 
variables that characterize the observed sites: the previous type of vegetation— 
either crops, grassland or forest, known for 109 sites—and the relative difference 
in carbon stock between study and reference plots, when known and when the 
reference resembled the previous vegetation type (n = 88 sites). The relative change 
AC,e is calculated as: 

ACra = [C(0, 100 CM) studied site — c(0, 100 CM) reference site]/C(0, 100 CM) reference site 
AC, is nil on average in the database, that is, it corresponds to the steady state 
(AC,e1 = 0.004 + 0.026, + s.e.m.); however, it does vary as a result of changes in inputs 
or dynamics in different directions. Mean durations of change are independent 
of previous vegetation in the statistical analysis: 31 years for previous grassland, 
37 years for previous crops and 40 years for previous forests (excluding durations 
of more than 1,000 years, which involve no crop). AC;« is not correlated with the 
duration of the change either. 

Concerning the depth distribution of new carbon, that is, R30-100 = Cnew(30 to 
100 cm)/Cyew(0 to 100 cm), R3o-199 is not correlated with A C,.) either in the whole 
dataset (r? = 0.002; n = 88), or within the subsets of crops (7? = 0.01; n = 31), 
grasslands (1° = 0.02; n = 24) or forests (7° = 0.13; n = 33). We also tested the pre- 
vious vegetation type as an explanatory variable of R3o-190 in addition to the other 
variables of climate, present land use and time (that is, the variables in Extended 
Data Table 4). The additional variable was not a significant factor (previous forest 
versus previous crop: P = 0.88; previous grassland versus previous crop: P = 0.52; 
previous grass versus previous forest: P = 0.47) and did not improve the model. 

Concerning the proportion of new carbon in either topsoil or subsoil (that is, 
fnew)» the previous vegetation type added as an explanatory variable in the statis- 
tical models of Extended Data Tables 1 and 2 was not a significant factors either 
(P = 0.49 to 0.99). By contrast, as an additional variable, AC, was highly signifi- 
cant for topsoil (P < 0.01) but was not for subsoil (P = 0.12). The effect is obvious 
given that both carbon change and new carbon are first driven by the relative 
change in inputs. This effect typically explains one of the results, namely the lower 
proportion of new carbon in cropland topsoils (Extended Data Table 1). 
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Concerning the age distribution in the subset of tropical grasslands and forests 
(Fig. 2 and Extended Data Table 3), the mean value of AC;« is very low (0.02 + 0.03; 
+ s.e.m.), close to steady state, and AC,.| does not depend on time, and therefore 
does not affect the estimated mean age distribution, but does contribute to the 
random dispersion of results. 

Finally, we included neither previous vegetation as an explanatory variable in the 
statistical models of the proportion of new carbon, nor carbon change, because of 
the covariance of AC,.) with land use. Furthermore, sites with previous or present 
croplands may have experienced a complex land-use history involving ancient pri- 
mary forests and possibly pasture events. Taking all land-use histories into account 
would become a case-by-case study. 

On the basis of this analysis of the inference of vegetation changes, we conclude 
that perturbation did not bias our estimates of the mean depth distribution of 
new carbon; that is, this depth distribution depends on the present vegetation 
and conditions, and not on previous vegetation, nor is it affected by non-steady- 
state conditions, in any systematic direction. The impact of perturbation on the 


proportion of new carbon in topsoils nevertheless prevented us from integrating 
our data towards global estimates of the absolute amount of new carbon or global 
carbon turnover. We thus restricted global integration to the depth distribution 
and median depth of new carbon. 

Data availability. The raw primary data, calculated data and ancillary information 
analysed and generated here are available in the INRA public repository (https:// 
doi.org/10.15454/KMNR@R). No statistical methods were used to predetermine 
sample size. 
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Extended Data Fig. 1 | Locations of the study sites. Source of background image: Visible Earth, NASA. 
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Extended Data Fig. 2 | Kinetics of new-carbon incorporation 

for the depth layers 0-30 cm and 30-100 cm. The respective 
logarithmic regressions y = 0.30 x logyo(x) — 0.07 for 0-30 cm and 
y = 0.26 x logio(x) — 0.23 for 30—100 cm indicate that the duration 
required to replace one-third of the carbon is on average seven times 
longer in the subsoil than the topsoil. 
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Extended Data Table 1 | Proportion of new carbon in topsoil: multivariate linear regression 


Coefficient Standard Tvalue Pr(>|7]) 


estimate error 

Intercept = Forest -0.00250 0.0728 -0.034 0.973 
Grassland -0.0531 0.0405 -1.311 0.193 
Cropland -0.0951 0.0328 -2.896 0.00472 = 
Log10(time) 0.258 0.038 6.865 7.6e-10 a 
MAT 0.00595 0.00234 2.540 0.0128 . 
P/PET -0.0441 0.0337 -1.306 0.1954 

Clay -0.000566 0.000693 -0.816 0.417 


The dependent variable is the ratio of new carbon (derived from the vegetation after time t) to total organic carbon in the topsoil layer. The explanatory variables are land use (grassland or cropland), 


logio(t) (in years), mean annual temperature (MAT, in °C), ratio of annual precipitation to evapotranspiration (P/PET), and topsoil clay content (as a percentage). The reference land use (intercept) is 
forest. T is the value of Student's statistics; Pr(>|T|) is the probability value of the Student's t-test. 
*P <0.05; **P < 0,01; #**P < 0.001. 


Residual standard error, 0.1249 on 92 degrees of freedom; multiple R?, 0.4781; adjusted R?, 0.444; F-statistic, 14.04 on 6 and 92 degrees of freedom; P-value, 2.768 x 10 AL 
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Extended Data Table 2 | Proportion of new carbon in subsoil: multivariate linear regression 


Coefficient Standard Tvalue Pr(>|7]) 


estimate error 

Intercept = Forest 0.0205 0.0433 0.474 0.637 
Grassland -0.0136 0.0241 -0.564 0.574 
Cropland 0.0024 0.0195 0.121 0.904 
Log10(time) 0.0849 0.0223 3.806 0.00025 *** 
MAT 0.00216 0.00139 1.551 0.124 

P/PET -0.0516 0.0201 -2.570 0.0118 . 
Clay 0.000018 0.000412 0.045 0.965 


The dependent variable is the ratio of new carbon (derived from the vegetation after time t) to total organic carbon in the subsoil layer. See Extended Data Table 1 for further definitions. Residual stand- 
ard error, 0.07424 on 92 degrees of freedom; multiple R?, 0.2561; adjusted R2, 0.2076; F-statistic, 5.279 on 6 and 92 degrees of freedom; P-value, 0.0001028. 
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Extended Data Table 3 | Age distribution of carbon over 55 tropical grassland and forest soil profiles 


Depth at ki (yr") a2 ka (yr') at + a2 Standard deviation 
of residuals 

0 cm 0.614 0.21 0.34 0.0073 0.95 0.18 
(0.47, 0.75) (0.11, 2.8) (0.2, 0.49) (0.0024, 0.0158) (0.92, 1.0) 

10cm 0.287 0.15 0.67 0.0059 0.96 0.11 
(0.19, 0.40) (0.08, 0.5) (0.56, 0.77) (0.0028, 0.0083) (0.94, 1.0) 

20cm 0.108 0.23 0.85 0.0047 0.95 0.09 
(0.05, 0.19) (0.09, 1.5) (0.78, 0.92) (0.0025, 0.0072) (0.92, 1.0) 

30cm 0.074 0.24 0.86 0.0035 0.93 0.10 
(0.02, 0.13) (0.13, 1.5) (0.8, 0.93) (0.0020, 0.0064) (0.88, 1.0) 

40cm 0.070 0.23 0.83 0.0026 0.90 0.10 
(0.03, 0.13) (0.09, 1.0) (0.74, 0.93) (0.0014, 0.0041) (0.83, 1.0) 

50cm 0.066 0.21 0.80 0.0020 0.87 0.09 
(0.04, 0.11) (0.10, 0.5) (0.70, 0.95) (0.0010, 0.0030) (0.79, 1.0) 

60 cm 0.065 0.20 0.75 0.0018 0.82 0.10 
(0.03, 0.11) (0.10, 0.4) (0.65, 0.89) (0.0008, 0.0028) (0.73, 0.94) 

70cm 0.052 0.22 0.71 0.0016 0.76 0.10 
(0.02, 0.09) (0.13, 0.5) (0.62, 0.88) (0.0007, 0.0024) (0.67, 0.95) 

80cm 0.044 0.25 0.65 0.0016 0.70 0.11 
(0.01, 0.08) (0.15, 3.2) (0.54, 0.92) (0.0005, 0.0024) (0.60, 0.93) 

90 cm 0.042 0.25 0.60 0.0016 0.64 0.11 
(0.01, 0.08) (0.14, 3.5) (0.46, 0.99) (0.0003, 0.0025) (0.51, 1.0) 

100 cm 0.048 0.25 0.55 0.0014 0.60 0.11 
(0.01, 0.08) (0.14, 3.3) (0.44, 0.99) (0.0002, 0.0025) (0.48, 1.0) 


These data were used to generate Fig. 2. At each depth, the proportion fnew of carbon aged less than t was fitted by a nonlinear regression of time t using the equation fnew = a1.[1 — exp(—ki x ] + 
a.[1 — exp(—ko x t)]. Such bi-exponential functions®° describe carbon age distribution, with carbon divided into three age classes, a; being the proportion of ‘young’ carbon, az the proportion of ‘old’ 
carbon, and (1 — a; — az) the proportion of carbon with an infinite age. 1/k; and 1/k2 are the mean ages of young and old carbon, respectively. Numbers in parentheses denote the 95% confidence 
intervals of the estimated parameters. The median environmental conditions of the soil set are: MAT = 23.6 °C; annual precipitation = 2,100 mm; P/PET (ref. 23) = 1.44 and topsoil clay content = 37%. 
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Extended Data Table 4 | Depth incorporation of new carbon in subsoil: multivariate linear regression 


Coefficient Standard Tvalue Pr(>| 71) 


estimate error 
Intercept = Forest 0.193 0.049 3.979 0.00014 e 
Grassland 0.0218 0.03499 0.624 0.534 
Cropland 0.105 0.029 3.646 0.00044 a 
Time (yr) 0.000369 0.000320 1.155 0.251 
MAT (°C) 0.00381 0.00206 1.848 0.0677 
P/PET -0.0996 0.0297 -3.354 0.00116 oe 
Clay (%) 0.000638 0.000608 1.049 0.297 


The dependent variable is the ratio of the amount of new carbon (derived from the vegetation after time t, in kg m~“) in the subsoil layer to the amount of new carbon in the entire top metre—that is, 
R30-100 = Cnew(30 to 100 cm)/Chew(O to 100 cm). See Extended Data Table 1 for further definitions. Note the dependence on time: the maximum value of the coefficient at the 95% confidence level 
(estimate + 2 s.e.m.) is 0.001 yr 1 Residual standard error, 0.11 on 92 degrees of freedom; multiple R?, 0.2387; adjusted R2, 0.1891; F-statistic, 4.808 on 6 and 92 degrees of freedom; P-value, 
0.0002618. 
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Extended Data Table 5 | Median depth of new carbon: multiple linear regression 


Coefficient Standard Tvalue 


estimate error 
Intercept = Forest 11.3 2.183 5.174 
Grassland 3.93 1.463 2.689 
Cropland 8.77 1.28 6.842 
Time (yr) 0.0201 0.0137 1.466 
MAT (°C) 0.183 0.089 2.045 
P/PET 5.11 1.32 -3.871 


Pr(>| 71) 


1.31¢e-06 
0.00849 
8.18e-10 
0.146 
0.0437 


0.000202 


Kk 


eeK 
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The dependent variable is the median depth (in cm) of the amount of new carbon (carbon derived from the vegetation after time t, in kg m~°) of each profile. See Extended Data Table 1 for further 
definitions. Residual standard error, 4.963 on 93 degrees of freedom; multiple R2, 0.3985; adjusted R2, 0.3662; F-statistic, 12.32 on 5 and 93 degrees of freedom; P-value: 3.551 x 10-9. 
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Extended Data Table 6 | Depth distribution of carbon transferred from atmosphere to SOM in 1965-2015 


Observed Observed Observed Global 
Forests Grasslands Croplands estimate 

0-10 cm 0.538 0.511 0.321 0.521 
(0.048) (0.071) (0.032) (0.046) 

10-20 cm 0.174 0.188 0.237 0.189 
(0.026) (0.021) (0.019) (0.020) 

20-30 cm 0.084 0.092 0.159 0.098 
(0.016) (0.020) (0.017) (0.017) 

30-40 cm 0.052 0.056 0.091 0.056 
(0.012) (0.012) (0.013) (0.009) 

40-50 cm 0.037 0.045 0.065 0.040 
(0.007) (0.011) (0.010) 0.008) 

50-60 cm 0.032 0.028 0.037 0.030 
(0.007) (0.010) (0.007) 0.008) 

60-70 cm 0.024 0.022 0.026 0.022 
(0.006) (0.010) (0.005) 0.006) 

70-80 cm 0.023 0.022 0.023 0.021 
(0.007) (0.009) (0.005) (0.006) 

80-90 cm 0.014 0.020 0.019 0.013 
(0.006) (0.007) (0.004) (0.007) 

90-100 cm 0.013 0.018 0.017 0.011 
(0.005) (0.007) (0.004) (0.008) 


These data were used to generate Fig. 3. The amount of new carbon transferred from the atmosphere to SOM in 1965 to 2015 (<50 yr carbon) in each 10-cm layer is expressed as a proportion of the 
total new carbon <50 yr in the first metre. The data shown are mean values for observed forests, grasslands and croplands, and global estimates; the numbers in parentheses are the 95% confidence 
intervals on the mean or estimate. 
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The approximately 10,000-year-long Last Glacial Maximum, before 
the termination of the last ice age, was the coldest period in Earth’s 
recent climate history’. Relative to the Holocene epoch, atmospheric 
carbon dioxide was about 100 parts per million lower and tropical sea 
surface temperatures were about 3 to 5 degrees Celsius lower”. The 
Last Glacial Maximum began when global mean sea level (GMSL) 
abruptly dropped by about 40 metres around 31,000 years ago* and 
was followed by about 10,000 years of rapid deglaciation into the 
Holocene!. The masses of the melting polar ice sheets and the change 
in ocean volume, and hence in GMSL, are primary constraints for 
climate models constructed to describe the transition between the 
Last Glacial Maximum and the Holocene, and future changes; but 
the rate, timing and magnitude of this transition remain uncertain. 
Here we show that sea level at the shelf edge of the Great Barrier Reef 
dropped by around 20 metres between 21,900 and 20,500 years ago, 
to —118 metres relative to the modern level. Our findings are based 
on recovered and radiometrically dated fossil corals and coralline 
algae assemblages, and represent relative sea level at the Great 
Barrier Reef, rather than GMSL. Subsequently, relative sea level rose 
at a rate of about 3.5 millimetres per year for around 4,000 years. 
The rise is consistent with the warming previously observed at 
19,000 years ago’, but we now show that it occurred just after the 
20-metre drop in relative sea level and the related increase in global 
ice volumes. The detailed structure of our record is robust because 
the Great Barrier Reef is remote from former ice sheets and tectonic 
activity. Relative sea level can be influenced by Earth's response to 
regional changes in ice and water loadings and may differ greatly 
from GMSL. Consequently, we used glacio-isostatic models to derive 
GMSL, and find that the Last Glacial Maximum culminated 20,500 
years ago in a GMSL low of about — 125 to — 130 metres. 

The sea level rise between the Last Glacial Maximum (LGM) and 
the Holocene was at times episodic and stimulated particular patterns 
in coral reef growth and evolution. Estimates of the maximum volume 
of excess ice and amount of water that contributed to the change in 
the corresponding GMSL was originally based on the stratigraphy of 
radiocarbon and U-series dated corals from Barbados®’. Data from 
radiocarbon-dated microfossils and macrofossils also helped to define 
LGM palaeo-shorelines from the Sunda Shelf in the South China 
Sea® and the Bonaparte Gulf of northern Australia®?. However, large 
uncertainties remain in local relative sea level (RSL) data and in glacio- 
isostatic (GIA) model inputs, such as ice histories and Earth rheology. 
These are needed to estimate the bounds of past and future GMSLs, 
which, for the LGM, range*!° from 115m to 135m. 

Precise and accurate sea level histories, often derived from dating 
of fossil corals and algae, are important. Coastal shelf geometry and 


location of land-bridges and islands at the LGM have a bearing on 
probable human migration routes and affect the ecological and spe- 
cies diversity, for example, of endemic flowering plants on islands that 
had complex, dynamic histories and covered a larger area during the 
LGM1"1. Sea levels vary with polar ice volumes and the extent of LGM 
ice-sheets can affect atmospheric pressure patterns and alter the salinity 
of oceans causing circulation changes’. 

Fossil coral and coralline algae deposits of the LGM-Holocene 
period at the Great Barrier Reef (GBR) are below the present sea 
level and were drilled during the Integrated Ocean Drilling Program 
(IODP) Expedition 325 in 2010” (Fig. 1). Corals and coralline algae 
were recovered from 34 holes, over two transects 500 km apart, at 
Hydrographer’s Passage (HYD-01C) and Noggin Pass (NOG-01B; 
Fig. 1). Depths of up to 150 m below sea level were reached to access 
the full LGM period”. Selected well characterized samples were dated 
using U-series (coral) and accelerator-based radiocarbon (coral and 
algae) methods (Extended Data Table 1). Sea level depth uncertainties 
depend on the palaeo-habitat depth range of particular coral and algae 
species and were conservatively assessed in conjunction with associated 
algal crust thickness, vermetid gastropods and by benthic foraminiferal 
assemblages'*' (Figs. 2 and 3, Methods and Extended Data Figs. 1-6). 

A brief outline of previous determinations of the timing and duration 
of the LGM shows that it extended from about 29,500 years (29.5 kyr) 
ago to 19 kyr ago’. There is an initial rapid (>40 m) fall in GMSL from 
31-32 kyr ago to 29-30 kyr ago (Fig. 4a, b)!*. A protracted gradual 
GMSL drop was construed at about 29-21 kyr ago*. However, this 
was largely an extrapolation between the two endpoints owing to the 
sparsity of data which also have large (about 20 m) uncertainties in RSL 
elevations (Fig. 4c, e)*"”. The onset of deglaciation is apparent from 21 kyr 
ago with a gradual 10-15 m GMSL rise* followed bya short stable, or 
possibly slowly falling, GMSL from around 18 kyr ago to 16.5 kyr ago 
(see figure 4a, b of ref. *). From then on, the deglaciation proceeded at a 
fast pace, at times exceeding 12 m kyr! during the so-called meltwater 
pulses!*}5, 

We converted our new GBR local sea levels to global values through 
GIA modelling (Methods), which accounts for the higher GBR coastline 
elevations owing to increased ice volumes and reduced adjacent ocean 
water loading. Our new GBR local sea levels, converted to global values 
through GIA modelling (Methods), substantially diverge from earlier 
determinations and completely revise the internal structure of the LGM 
GMSL (Fig. 4b). A large number of data points from the Noggin Pass 
and Hydrographer’s Passage sections show a relatively constant sea level 
from about 28 kyr ago to 22 kyr ago after the initial rapid fall from 
31 kyr ago4, as documented previously from Barbados, Huon and the 
Bonaparte Gulf (Figs. 3 and 4c—h). The GMSL remains invariant from 
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Fig. 1 | Location of GBR Expedition 325 study site. Maps show the 
northern and southern sites at Cairns (Noggin Pass, NOG-01B) and at 
Mackay (Hydrographer’s Passage, HYD-01C). The high-resolution three- 


30 kyr ago to 21.5 kyr ago at a mean value of 113 m within a range 
of about 6 m and represents the early LGM period, LGM-a (Fig. 4b; 
red band). This is much shallower than former estimates? (Fig. 4b). 
The previously identified'*’” ‘onset of deglaciation about 19 kyr ago in 
fact corresponds to a rapid sea level fall followed by an approximately 
4-kyr-long and 3.5 m kyr“! sea level rise. We label this period LGM-b, 
lasting from about 21 kyr ago to 17 kyr ago. The minimum GMSL at 
LGM-b ranges from about 130 m to 125 m owing to analytical uncer- 
tainties and uncertainties in Earth model parameters and occurs after 
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dimensional multibeam images show the surface geomorphic context’, 
drill transects and specific locations of the drill holes. 


an approximately 20 m drop (Methods, Extended Data Figs. 8 and 9 
and Extended Data Table 1). This point defines the start of LGM-b, after 
which the GMSL slowly rises to 120 m about 17 kyr ago followed by a 
rapid transition towards full deglaciation. The rates of net ice mass gain 
or sea level fall at the inception of both the LGM-a (about 30 kyr ago) 
and LGM-b (about 21 kyr ago) are similar, in the approximate range 
15-20 mm yr7!, as determined from the slopes of the GMSL curves. 
This GIA-modelled, very fast, maximum glaciation rate is notably 
faster than the mean LGM-Holocene deglaciation rate* of around 
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Fig. 2 | LGM sea level drop at LGM-b. Here, the sea level drop is apparent 
as a distinct age discontinuity observed in cores M0055A and M0031-33A 
from the NOG and HYD transects, respectively, separated by more than 
500 km in the GBR. Cores obtained from different fossil GBR terraces and 
reefs reveal the trajectory of past sea level changes. Five reef sequences 

are distinguished on the basis of the IODP Expedition 325 record: Reef 1 
(>30 kyr old), Reef 2 (27-22 kyr old), Reef 3 (3a, 21-17 kyr old; 3b, 17-13 
kyr old), Reef 4 (13-10 kyr old) and Reef 5 (the modern GBR). The major 
growth hiatus evident from both the HYD and the NOG sections 

marks the death of Reef 2 (22.1-21.9 kyr ago) following the sea level fall 
leading to LGM-b and re-establishment of reef (Reef 3a) further seaward 
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20.7-20.5 kyr ago (Extended Data Figs. 3-5). The age versus depth 
relationships, and the presence of freshwater low-magnesium calcite 
cement, in Reef 2, confirms subaerial exposure during the LGM-b period. 
In contrast, their absence below Reef 3a deposits places a maximum limit 
on the drop in sea level (Methods). At the end of the LGM, 17 kyr ago, 

the sea level rose rapidly, flooding the outer shelf and causing the 
re-establishment of the reef over its former position (Reef 3b), marking the 
end of the hiatus at the top of Reef 2 about 17 kyr ago. The details of the 
transition to LGM-b and the critical samples defining the fall in sea level 
are shown in Extended Data Fig. 6. 
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Fig. 3 | Age versus depth plots showing the RSL envelopes derived from 
samples recovered by IODP Expedition 325. a, b, Both HYD (a, blue) 
and NOG (b, red) transect cores were examined for coral, coralgal algae 
and benthic foraminiferal assemblages (Extended Data Figs. 1 and 2) with 
the help of X-ray computed tomography scanning and X-ray diffraction 
(Supplementary Information), which revealed detailed features of RSL 
histories during the past 35 kyr. The >500 dates, selected through detailed 
sedimentologic and biologic analyses (Methods), provided a robust 
chronostratigraphic framework that defined five distinct reef sequences 
(Reefs 2 to 4 are labelled in Fig. 2), which grew episodically over the past 
30 kyr. The RSLs constructed here depend on the age (horizontal grey 
lines) and the sea level depth uncertainties (bars are ranges) related to 


12 mm yr. The sea level drop at LGM-b is strikingly evident in cores 
from both HYD and NOG sections as a major growth hiatus in cores 
M0031-33A and M0055A-33 (Fig. 2). Deposits of freshwater calcite 
cement, in corals below the hiatus, indicate subaerial exposure and 
low sea levels (Methods; Extended Data Fig. 5). In turn, the gap corre- 
sponding to the low in sea level is picked up at five other sites, M0035, 
M0036, M0039, M0053 and M0054, at lower elevation, to complete the 
sea level curve (Extended Data Fig. 3). 

We have established an ice model, based on the new GBR RSL 
data, that provides good agreement with other far-field RSL records 
when combined with a range of Earth model parameters that span the 
possible range of viscosities for the upper and lower mantle as well as 
lithospheric thickness. Model predictions of RSLs over the LGM-a and 
LGM-b periods generally agree well with the trends derived from exist- 
ing data (Fig. 4, Extended Data Fig. 7; Supplementary Information). 
The magnitude of the sea level drop at the start of the LGM" (about 
29 kyr ago) is about 40 m, constrained by coral data from the Huon 
Peninsula!’ and Barbados’, and by foraminifera oxygen isotope 
records from the Red Sea” within relatively large (about 10 m) uncer- 
tainties. There is also close agreement between model results and 
data over the deglaciation period from 17 kyr ago onwards except at 


26 28 30 32 34 
e HYD-01C U/Th corals 
o HYD-01C '4C-AMS corals 
® HYD-01C '4C-AMS coralline algae 
e NOG-01B U/Th corals 
o NOG-01B '4C-AMS corals 
® NOG-01B '4C-AMS coralline algae 

ee 
1 1 1 1 1 
26 28 30 32 34 


palaeo-habitat depth range (grey upward lines) of each dated coral or 
algal sample and the maximum coring depth uncertainty (grey downward 
lines). The palaeowater depth ranges for shallow water species (blue and 
red shading) were conservatively estimated using a multiproxy approach 
combining coral, coralline algae and other key indicators such as algal 
crust thickness, and the presence of vermetid gastropods (Methods, 
Extended Data Table 1). We note that the disconnected palaeowater depth 
lines on some samples are indicative of deeper habitat ranges, probably 
>20 m water depth. The distribution of marine and freshwater cements 
in the cores were used to support the new estimates of the timing and 
magnitude of the LGM-b sea levels (Extended Data Figs. 3-5), including 
hiatuses and regressions. 


Barbados (Fig. 4c). Here, the systematic overshoot of the data above the 
sea level curve is indicative of additional processes, possibly tectonic in 
nature’. This is consistent with independent evaluations including a 
study’ that uses the rate of change of degree-two harmonics of Earth’s 
geopotential due to GIA. 

The very rapid build-up of global ice volume during the two peri- 
ods of transition at LGM-a (about 29-31 kyr ago; ref. 4, Fig. 4d) and 
LGM-b (about 22-21 kyr ago; Fig. 4b) requires substantial moisture 
transport and snow precipitation over existing ice sheets. To accommo- 
date LGM-b an additional equivalent ice volume corresponding to up 
to 17 m of sea level is required. The location of the extra ice cannot be 
determined with certainty. GIA modelling and Northern and Southern 
Hemisphere bipolar climate paced by complementary high-latitude 
insolation highs'*!* at these times shows increased ice volume over 
the North American Ice Sheet at the LGM, whereas the Eurasian ice 
sheet appears to have grown at a slower pace and commenced melting 
after about 22 kyr ago (Extended Data Fig. 8). Colder Antarctic climate 
during the LGM"® is likely to have hindered ice calving and lessened 
basal melting of ice shelves, resulting in increased ice volume. The sus- 
tained growth of the Antarctic Ice Sheet during the LGM-b period and 
beyond, including continuing ice accumulation up to around 14 kyr 
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Fig. 4| GMSL and GIA-model-derived far-field RSL with observational 
data. a—h, The previously constructed GMSL curve over the past 140 kyr 
(ref. 4; blue line in a and b), is shown together with GIA model predictions, 
in c-h, for selected far-field data represented by red circles with depth 
and age error bars (standard deviation). High-latitude (65°) summer 
insolation curves for the Northern and Southern Hemispheres in b are 
shown as dashed orange and blue lines, respectively. Long-term sea 
level variation is in step with Northern Hemisphere summer insolation, 
whereas LGM-b occurs at the peak of Southern Hemisphere insolation. 
Orange circles in c-h indicate previously reported RSL with depth and 
age uncertainties (horizontal and vertical black lines). Results of 62 GIA 


ago, agrees with observations of the late retreat of the West Antarctic 
Ice Sheet at this time!?**. However, the major increase in ice volume, 
precipitating the onset of LGM-b, appears to have been during a short 
period (20 to 21 kyr ago) over the North American Ice Sheet, after 
which the North American Ice Sheet retreated from about 20 kyr ago 
onwards”> (Extended Data Fig. 8). 

The enlarged global ice volume about 30 kyr ago , equivalent to 
about 40 m of sea-level drop, persisted for over 8 kyr. Similarly, the low 
Northern Hemisphere insolation and somewhat reduced atmospheric 
carbon dioxide levels around 21-22 kyr ago led to a period of cold cli- 
mate, very low tropical Atlantic sea surface temperatures and ultimately 
the transition to LGM-b. At this time, increased Southern Hemisphere 
insolation is likely to have facilitated moisture transport to the South, 
increasing the Antarctic Ice Sheet volume”!*””. The Heinrich event 1 
about 17 kyr ago marks the end of LGM-b when full deglaciation kicked 
in as the pace of Northern Hemisphere insolation and atmospheric 
carbon dioxide levels increased rapidly (Fig. 4). 

The two sharp transitions preceding the LGM-a and LGM -b periods, 
associated with rapid accumulation of ice and lower sea levels, at 
the end of the last ice age, do not appear to be explicable in terms of 
processes attributable to any specific climate-change dynamic. During 
this time (around 29-19 kyr ago), oxygen isotope records in ice cores do 
not show a clear, distinct signal; the carbon dioxide levels were stable, 
insolation at the time was not very different to the present and tropical 
sea surface temperature did not change notably’. A systematic 
behaviour in sea level and climate has previously been noted”? 
whereby transitions between two states, cold to warm or warm to 
cold, took place via a third state that was more extreme, as in the 
MIS3-LGM- Holocene transition (where MIS3 indicates the last ice 
age at marine isotope stage 3) or at the end of the last interglacial, a 
final high-stand before the MIS 5d transition. A similar behaviour in 
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model runs (Methods), over a range of potential Earth model parameters, 

such as lithospheric thickness and the viscosities of both the upper and 

lower mantle, are within the orange shading in a and b and grey shading 

in c-h. The red and blue curves in the grey shading in c-h show the results 

for lower-mantle viscosities of 10°” Pa s and 10” Pa s, respectively, with 

a lithospheric thickness of 70 km and upper-mantle viscosity of 10°° Pas. 

We note the foraminiferal oxygen-isotope-based Red Sea data”® (h), 

which supports the new GMSL calculations from the present study within 

uncertainties (about 10 m). The orange shading in h represents confidence 

intervals of 95% for the RSL data (light orange) and the probability 

maximum (dark orange) reported in ref. 7°. 


climate was previously noted where the appearance of an interme- 
diate, extreme third state may have resulted in a shift from “41-kyr’ 
cycles to ‘100-kyr’ cycles about 800-1,000 kyr ago*’. These bifurca- 
tions can be thought of as distinct energy states of climate between 
which ‘stochastic perturbations’ can cause transitions”’. It appears that 
the same behaviour may occur over short timescales, not only over 
100-kyr cycles. The transitions are not only manifest in climate but are 
also associated with sea level change. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0335-4. 
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METHODS 

IODP Expedition 325: the GBR environmental changes. The IODP Expedition 
325 was designed to complement the previous IODP “Tahiti Sea Level’ Expedition 
310 to Tahiti*". In preparation for Expedition 325, potential drill sites were sur- 
veyed with the CSIRO ship RV Southern Surveyor using multibeam sonar, seismics, 
an autonomous underwater vehicle and rock dredging!***. At most of the GBR 
locations, the shelf breaks at approximately 120 m and is populated with prominent 
terrace-like structures; other relict reefs appear successively at depths*** of 100-90 m, 
60-50 m and 35-40 m. 

The mission-specific platform chosen for Expedition 325 was the Greatship 
Maya, an IMO class II vessel capable of being positioned dynamically for geo- 
technical coring**>. The expedition took place between 12 February and 6 April 
2010. A total of 34 holes across 17 sites were sampled, ranging in depth from 46.4 m 
to 170.3 m such that the recovered coralgal deposits span several crucial but 
poorly defined periods during the LGM and last deglaciation. Sampling occurred 
at three locations along the northeastern coast of Australia, but this study focuses 
on one transect at 19.7° latitude offshore of Cairns at Noggin Pass (NOG-01B), 
and another at 17.1° latitude, offshore of Mackay at Hydrographer’s Passage (Fig. 1, 
HYD-01C). Photographs of half-sectioned reef cores, relevant to the present study, 
are shown in Extended Data Figs. 1 and 2 together with the depths and genera 
of the dated corals and coralline algae. Classification of coral genera and species 
in terms of their habitat preferences according to depth and turbulence levels is 
also supported by considering the habitats of associated coralline algae taxa and 
crust thickness and vermetid gastropods. For example, individual coralline algae 
can have a limited range of habitats—bounded by sensitivity to light levels, wave 
energy and other factors—and can be used to more accurately constrain the palaeo- 
depth ranges. Using this careful and detailed multi-proxy approach, each dated 
coral and coralline algae sample was placed within an internally consistent coralgal 
assemblage and palaeobathymetric scheme*’, enabling the construction of an 
independent RSL envelope at each site (Extended Data Figs. 1-3, Extended Data 
Table 1 and Supplementary Information). 

Main lithologic facies observed in the Expedition 325 cores. The main lithologies 
are divided into coral reef framework and detrital sedimentary facies. The three 
boundstone facies are defined by their varying proportions of corals, coralline 
algae and microbial deposits forming coralgal, coralgal-microbialite and micro- 
bialite-dominated boundstones. The detrital facies can occur locally as internal 
sediments within the boundstones, or as metre-scale intervals of packstones to 
rudstones and unconsolidated sediments. Details of facies and depth estimates 
using facies as well as coral and coralline algae assembly can be found in Extended 
Data Figs. 1 and 2 and Extended Data Table 1, and are derived from ref. T2 

Reef 2 hiatus and GMSL drop to LGM-b. A major growth hiatus is observed 
in the inner shelf terrace at 104-106 m below sea level, at both HYD-01C (Hole 
M0031-33A) and NOG-01B (Hole M0055A) (Extended Data Figs. 3 and 4)'?. This 
represents the turn-off (death) of Reef 2 about 21 kyr ago and is interpreted to be 
caused by the drop in sea-level to the LGM-b. The coralgal assemblages show that 
palaeowater depths were shallow (<10 m) just before the death of Reef 2, and lith- 
ologic and seismic evidence indicates that Reef 2 was a major subaerial exposure 
surface!”, Furthermore, detailed scanning electron microscopic, energy-dispersive 
X-ray spectrum, X-ray diffraction analyses and thin-section observations of Reef 2 
deposits confirm that they were exposed to freshwater or subaerial environments 
(for example, low-magnesium calcite cements in Hole M0055A Core 4R1) during 
the sea level lowstand at LGM-b (Extended Data Fig. 5). At this time shallow reef 
development migrated about 1-0.25 km seawards (that is, Reef 3a) in <2 kyr, as 
the RSL sea level fell to 118 m below present by about 20.5 kyr ago. The Reef 3a 
deposits and the older-than-MIS3 deposits are characterized by wholly marine 
diagenetic features consistent with the interpretation that the LGM-b sea level did 
not fall below this level (Extended Data Fig. 6). Sea level rose during the deglacial, 
causing major Reef 3a aggradation” before re-flooding the inner shelf terraces 
about 16.5 kyr ago and causing the re-establishment of the reef (Reef 3b) over its 
former position, marking the end of the hiatus at the top of Reef 2. 

Determining the ages of sea level indicators. More than 165 representative coral 
skeletons and their aragonite content were analysed by powder X-ray diffraction, 
X-radiography, scanning electron spectroscopic and petrologic investigations, all 
of which confirmed the pristine nature of the dated samples”. In a few cases, X-ray 
diffraction picked up minor signatures of high-magnesium calcite, probably due to 
trace amounts of coralline algae and microbialite sediment. Yet no notable calcite 
peaks were observed in most cases. Physical cleaning of branched corals for U-Th 
dating and severe acid dissolution (more than 50% of their weight) of samples for 
radiocarbon dating was used to remove potential secondary precipitated mate- 
rials. For massive Porites corals, physical cleaning is difficult, requiring further 
geochemical tests including inductively coupled plasma mass spectrometry anal- 
yses (ICP-MS). Skeletal Mg/Ca ratios confirmed the absence of large amounts of 
high-magnesium calcite and secondary aragonite cements. Even when second- 
ary aragonite was found, the ages are not much affected because the form of the 


cements is indicative of the early post-mortem phase of corals. Further evalua- 
tions included limits on total uranium, 2*2Th content and initial 2>4U/?3°U ratio. 
We applied different initial °“U/?**U criteria: for samples of the deglacial period 
between 17 kyr ago and the present, the acceptable range was 1.1452 + 0.0140, 
whereas for the samples from 30 kyr ago to 17 kyr ago 1.1402 + 0.0140 was used. 
All data used to reconstruct the RSL envelopes for each transect are shown in 
Supplementary Information (see Methods section ‘RSL reconstruction’ for more 
details) and the primary samples used to determine the specific RSL inflection 
points are indicated as ‘HY-1, 2’ and ‘NO-1, 2’ for HYD-01C and NOG-01B, 
respectively, in Supplementary Information and highlighted in bold. U-series and 
radiocarbon ages from the same coral samples also showed remarkable consistency, 
along with radiocarbon ages on directly adjacent coralline algae. Taken together, 
and combined with the consistent reproducibility of the RSL envelopes between 
two transects, more than 500 km apart, this confirms the veracity of the data. 
Radiocarbon dating. More than 500 radiocarbon dates were obtained using cor- 
als and coralline algae samples, which were all processed at the Atmosphere and 
Ocean Research Institute at the University of Tokyo to convert them into graphite**. 
Typically, 1 mg or more of graphite was measured using a Single Stage Accelerator 
Mass Spectrometer at the Atmosphere and Ocean Research Institute* and at the 
Australian National University (ANU). The results were then converted to cal- 
endar ages with local reservoir ages (12 + 10 yr) from ref. “!, which we obtained by 
averaging between Heron Island (8 +6 yr) and Abraham Reef values (15 +6 yr). 
The calibration was then performed using international calibration datasets (IntCal 
13 and Marine 13)*. 

Analytical procedures, mass spectrometry and U-Th dating. The analytical data 
are listed in Supplementary Information. Consistency between laboratories for 
replicate measurements of a single specimen is within 100 years, which is similar 
to the intra-coral variability observed in some specimens measured using the high- 
precision (Woods Hole Oceanographic Institute) method. Uranium series dating 
were conducted at three different laboratories: the Australian National University 
(ANU), the University of Oxford and the Woods Hole Oceanographic Institute. 
A 61-cm mass spectrometer is used at ANU, which can operate in charge mode*’. 
The Th, °Th and 7” Th isotopes were measured simultaneously in charge mode 
in Faraday cups using 20-pF feedback capacitors as active electrometer elements. 
The 777, 74U and ?°°U isotopes were also measured in charge mode, whereas 
?38U was simultaneously measured using a 10'°-C feedback resistor. The magni- 
tude of the *U low-mass tail was monitored continuously at mass ”*’U in charge 
mode. This was used to subtract the 77°U tail from under the 7*7U, 734U and 75U 
isotopes. Extensive measurements with an unspiked U-standard HU-1 showed that 
the shape of the °U tail remained invariant under a wide range of conditions, in 
particular, at the expected locations of the 77U, 734U, 739U and 7°°U peaks. Sample 
loads, on single rhenium filaments, ranged from 0.5 jig to 0.8 jg and the “*U beam 
intensity was kept between 8 x 101! A to 10 x 101! A for several hours. At these 
intensities, the 10!°-C feedback resistor was used to avoid response-time problems 
encountered with the considerably slower 10'!-Q resistors. The instrument was 
calibrated with reference to a secular-equilibrium standard HU-1.Comparisons 
with Western Australian last interglacial samples“ and with Hulu Cave speleothem 
data*> showed precise agreement with previous measurements. Sample processing 
followed previously established procedures**. U and Th were separated from the 
coral carbonate using U-Teva resin in a single pass. 

U-Th dating at the University of Oxford and the Woods Hole Oceanographic 
Institute were measured by multi-collectors with ICP-MS. At the University of 
Oxford, U and Th isotopes were measured with ion counter collectors for the 
minor isotope beams. Approximately 0.3 g of coral sample was dissolved and spiked 
with a mixed “°U:?”Th tracer. U and Th were purified and measured separately: 
U isotopes, statically; and Th by peak-hopping the 229 and 230 beams into an ion 
counter while normalizing beam intensity between the steps with either ?°°Th or 
235 measured in Faraday collectors. Instrumental biases and relative collector 
efficiencies are accounted for using standard sample bracketing using U and Th 
isotope standards*’. 

At the Woods Hole Oceanographic Institute, U and Th isotopes were measured 
by multi-collector ICP-MS in static mode with all isotopes in Faraday collectors*®. 
Large (about 5 g) subsamples of coral were dissolved and spiked with a mixed 
2331 736U.2°Th tracer, optimized for the LGM-to-deglacial age samples, and 
co-precipitated with Fe. To determine the *°Th/*°U, purified U and Th fractions 
are recombined such that U and Th are measured together at isotope ratios that can 
be closely matched to bracketing standards. The **4U/™°U is similarly determined 
statistically in Faraday collectors but on an unspiked aliquot. 

All activity ratios and ages are calculated using the half-lives reported in 
ref. >, Ages are presented in Extended Data Table 1 as ‘raw; assuming all °°Th is 
accumulated in the coral since growth, and age-corrected for detrital 7°°Th. The 
detrital correction makes use of the measured 7°*Th/**8U as a proxy for the amount 
of detrital contamination, an assumed detrital composition of crustal origin”, and 
an allowance for non-secular equilibrium of the contaminant. 
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RSL reconstruction. The sample context was assessed using established 
criteria!” including: (1) core quality, (2) orientation of well preserved corallites; 
(3) thick coralline algal crusts capping upper coral surfaces; (4) evidence of sub- 
strate attachment; and (5) the presence or absence and orientation of geopetals in 
lithified facies. Using these criteria, all the samples were classified into the following 
four context categories: (1) IS =in situ (convincing supporting evidence), 
(2) IS?=likely in situ (inclusive supporting evidence), (3) ISX=not in situ 
(convincing non-supporting evidence, and (4) ISN = status not known (inadequate 
evidence either way). Samples from highly drill-disturbed or poor recovery 
intervals were excluded. A total of 540 samples satisfying these criteria were used to 
construct a RSL envelope (upper and lower bounds) at both sites. Despite known 
temporal differences in ocean reservoir age, the coral U-Th and coral/coralline MC 
ccelerator mass spectrometry ages are remarkably consistent. However, wherever 
possible we used the more precise U-Th coral ages to constrain the upper and 
lower bounds of the envelopes. This was achieved by visually fitting a line through 
the dates that were >1 m apart and outside their analytical age errors, while also 
taking into account the upper bound of the palaeowater depth estimate of each 
sample and any core recovery uncertainties (Supplementary Information). Where 
multiple coral dates overlapped (in time) we used the mid-point between samples. If 
replicate age determinations were available for the same sample (that is, same 
coral colony or coralline algal crust) (Supplementary Information) an average 
age was calculated and plotted in Fig. 3. The upper bound or minimum position 
of the sea level envelope was further constrained by considering the overlapping 
palaeowater depth ranges of both the shallow-water sea level indicators and their 
coeval, deeper fore-reef slope equivalents. Finally, the major inflection points 
marking clear changes in the direction, amplitude and rate of RSL change were also 
identified for each envelope (Fig. 3). Thus the lower bound (that is, maximum 
sea level position) of RSL curves and the specific samples defining them are 
indicated in bold in Supplementary Information (that is, HY-1, 2 and NO-1, 2) 
and a close-up of the key samples constraining LGM-b is also shown in Extended 
Data Fig. 6. 

GIA model predictions. The GIA model calculations included an Earth model 
describing the viscoelastic properties of the solid Earth, as well as an ice com- 
ponent documenting the ice melting history, reconstructed mainly from far- 
field sea level observations**”. The Earth model is based on the seismologically 
derived Preliminary Reference Earth Model (PREM)! and consisted of an elastic 
lithosphere with an upper- and lower-mantle divide at 670-km depth. The litho- 
sphere thickness was 70 km and upper- and lower-mantle viscosities ranged from 
(1-10) x 107° Pa s and (1-100) x 10” Pa s respectively. This model provides an 
accurate treatment of time-dependent continental shorelines and the Earth rota- 
tion feedback on sea level*?. 

The ice model described above was adjusted to match the newly obtained 
(RSL) records from the GBR. The analytical uncertainties in RSL were taken into 
account and the shallow and deep extremes of the RSL envelope were tested. We 
first employed the ice history model developed by the ANU group’ and ran the 
GIA model to obtain RSL histories for the GBR. The ANU ice model with the 
same relative ice volumes was then scaled to fit the NOG and HYD RSL. The 
scaling was done manually within a reasonable range of various parameters and 
by keeping the relative ice volume of various ice sheets the same as in the original 
ANU model, though keeping the Eurasian ice model almost the same as the ANU 
model since the history of this ice sheet is reasonably well constrained from both 
observations and models?4~*° compared to other ice sheets. The chosen Earth 
parameters (lithospheric thickness 70 km, upper-mantle viscosity 2 x 107° Pas, 
and lower-mantle viscosity 107” Pa s)*”°* fit the GBR region Holocene sea levels 
well. The analytical uncertainties in RSL and the range in Earth model parameters 
(approximately +2.5 m contribution to GMSL) were used in calculating the MAX 
and MIN extremes (Extended Data Figs. 8 and 9). Supplementary Information 
shows GMSL contributions individually for each major ice sheet and for the ANU 
and highest and lowest sea level GMSL scenarios (in eustatic terms). The two 
GMSL curves were then used to construct RSLs in far-field sites with previously 
published RSL data for comparison (Fig. 4 and Extended Data Fig. 7). Potential 
Earth model uncertainties were also considered with variable lithosphere thickness 
and a range of viscosities, of the lower as well as the upper mantle, resulting in more 
than 60 GIA model experiments. Shaded areas of curves represent possible ranges 
in RSL for individual sites (Fig. 4 and Extended Data Fig. 7). Visual inspection of 
the results indicates that the MAX model fits the data remarkably well for almost 
all the far-field locations tested. In turn, this indicates that the shallow coral habitat 
depth estimates appear to be sufficiently robust without necessitating extended, 
deeper-water limits. 

During the LGM (30-19 kyr ago; Fig. 3), water depth uncertainties for most 
samples from the HYD and NOG transects are <5 m. Figure 2 shows RSL curves 
derived from MAX and MIN models. Various Earth rheology parameters were 
also tested using both MAX and MIN ice models where the shaded region around 
GMSLs in Extended Data Fig. 9 represents the corresponding range in RSLs. 
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During the LGM, the maximum magnitude of RSL difference” between the two 
transects is less than 10 m and hence RSL variations arising from hydro-isostasy are 
small. The models were run over the maximum possible ranges of the rheological 
parameters so that the range of RSLs depicted in the shaded zone in Extended Data 
Fig. 9 cover the full range of possibilities. Traditionally, lower-mantle viscosity has 
been estimated as about 107” Pa s using far-field RSL observations®’”*, whereas 
recent studies have reported much higher values‘ of about 7 x 107” Pas. Thus, 
assuming a typical lithospheric thickness and upper-mantle viscosity, respectively, 
of 70 km and 2 x 10?° Pa s, the maximum RSL differences associated with the 
above range of lower-mantle viscosities is about 5 m (Extended Data Fig. 9). This 
number is smaller than the typical uncertainties inherent in RSL observations at 
the GBR and is therefore well suited for reconstructions of GMSLs during LGM-a 
and LGM-b. In summary, we concluded that the MAX model provides the best 
estimate of GMSLs as well as indicating that these tighter depth uncertainties for 
GBR corals’ do provide consistent results. Therefore, the extended MIN to MAX 
range, employed here, covers the likely range of GMSL constructions with confi- 
dence, as can be ascertained by visual inspection (Extended Data Fig. 9). 

Glaciological evidence, including from ice cores, cannot easily accommodate 
the required increase in ice volume. However, the total increase is shared among 
the large ice sheets (Extended Data Fig. 8). Ice cores retrieved from Antarctica and 
Greenland are not able to resolve the required magnitude of elevation changes in 
continental interiors. It is also likely that current ice-free regions may have been 
places that retained the extra ice at these times. For example, recent evidence sug- 
gests that an extensive ice sheet was grounded on the Ross Sea for at least 3,700 
years’. New bathymetric data as well as glacial models support these conclusions®!. 
However, there is still scope to improve the glaciological models and we hope that 
our data will contribute to this effort. 

Discrepant GMSLs during the LGM at either —120 m or —140 m have been 
reported respectively for Barbados’ and the Bonaparte Gulf in North Australia®. 
This has now been reconciled using the recently reported Earth rheology model 
with J2 observations’? as well as considering subducting material in Barbados”. 
The model included 65-100 km of lithospheric thickness and upper- and lower- 
mantle viscosities of (1-3) x 107° Pa s and 10” Pas. The global RSL observations 
could reasonably be explained if GMSL during the LGM were about —130 m. 
This finding is consistent with the model derived from more than 1,000 far-field 
RSL observations‘. The results from the present study, for both the MAX and the 
MIN options, are respectively —125 m and —130 m and thus consistent with the 
independent estimates described above. 

Data availability. All the data used in this manuscript are available in 
Supplementary Information. The modified ANU ice model is available upon 
reasonable request. 

Code availability. The model that we employed in this paper is available from J.O. 
(okuno@nipr.ac.jp) and Y.Y. upon reasonable request. 
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Extended Data Fig. 1 | Simplified classification of the main coralgal robust branching Isopora (1) and corymbose Acropora gr. humilis (2); cB, 
assemblages observed in the Expedition 325 cores. Shallow reef branching Seriatopora (3) and Acropora sp. (4); cC, massive/encrusting 
habitats are represented by coral assemblages when associated with thick meruliniids (5); cD, encrusting to massive Porites (6) and encrusting 
crusts of aA1 coralline algae and vermetid gastropods (8). Deep, fore- Montipora; aA3, Mesophyllum and Lithothamnion (10); aA1, Porolithon 
reef slope settings are defined by coral assemblages cD when associated onkodes (7); and aA2, thin P. onkodes, Porolithon gardineri, Harveylithon 
with thin crusts of aA3 and in the absence of aAl and aA2. cA, massive/ gr. munitum (9). 
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Extended Data Fig. 2 | Representative facies observed in the Expedition 325 cores. The main lithologies are divided into coral reef framework, (1)-(3), 
and detrital sedimentary facies, (4)-(8). 
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Extended Data Fig. 3 | Stratigraphic synthesis of inner terrace cores text for details). Following deglacial sea level rise, reef growth was re- 
at Mackay. The vertical patterns in coral recovery, lithologies, coralgal established as Reef 3b turned on when the shelf reflooded before this reef 
assemblages and vermetid gastropods are summarized here. These data drowned after about 14 kyr ago (ka). (‘bst., boundstone; ‘mb, microbialite; 
define a major hiatus (pink line) in reef development as the top of Reef 2 ‘unconsol., unconsolidated sediment; and ‘gastro, gastropod.) 


was exposed following the sea level fall to the LGM-b low-stand (see main 
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Extended Data Fig. 4 | Close-up core images showing the boundary lithologies, coralgal assemblages and diagenetic features, including fresh 
between Reef 2 and Reef 3a. Cores M00033A-10R and 11R (HYD-01C) water and meteoric cements (blue star) indicating that the top of Reef 2 has 
and M00055A-4R (NOG- 1B) clearly define the nature of the Reef 2/ been subaerially exposed. (LMC, low-magnesium calcite; RD, reef-death 
Reef 3b boundary, which represents a major hiatus in reef growth (see 
Extended Data Fig. 3). This boundary is characterized by major changes in 


events.) 
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Extended Data Fig. 5 | Evidence of subaerial exposure. The blocky low- Then peloids (p) of high-magnesium calcite formed under marine 
magnesium calcite meteoric cement that is related to subaerial exposure is conditions and filled the remaining voids when they were submerged 
initially precipitated in the intergranular voids of grainstone in 55A 5RI. following re-flooding. Scale bar, 100 jum. 
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Extended Data Fig. 6 | Timing and extent of the sea-level drop at LGM- 
b. Age versus depth plot showing the key in situ RSL data points from 
Hydrographer’s Passage (HYD-01C) (in blue) and Noggin Pass (NOG- 
01B) (in red). Accelerator mass spectrometer !C ages derived from corals 
are indicated by open circles and those derived from coralline algae are 
indicated by crossed circles. U-Th coral ages are indicated by filled circles. 
Inflection points defining the maximum position of the RSL at HYD-01C 
and NOG-01B are also shown (see labels NO-5, 8, 9, and HY-3, 5, 6 in 
Supplementary Information, corresponding respectively to data points 11, 
8, 1 and 9, 7, 5 on this figure; see also Fig. 3). The combined RSL envelope 
represented by the black lines (maximum and minimum positions) takes 
into account the uncertainties in the age (2c), palaeowater depth and 
position in the core of each data point, which is illustrated by a coloured 
rectangle (blue for Hydrographer’s Passage and red for Noggin Pass) 
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(see Methods for more details). If we were to omit the '4C data owing to 
possible unaccounted-for variability in local reservoir ages®’, the LGM-b 
sea level drop defined by coral U-Th ages would be 1.5 kyr earlier at 

23.5 ka, corresponding to an extended sea level drop over about 3 kyr 

at a rate of about 7 m kyr~!. Key RSL index points are as follows. 1, 
325-M0053A-13R-1W 21-25 (20.51 ka, 117.93 m; NO-9). 2, 325-M0054B- 
06R-1W 64-67 (20.50 ka, 124.39 m). 3, 325-M0054B-07R-1W 5-9 

(20.47 ka, 125.3 m). 4, 325-M0035A-18R-1W 10-15 (20.43 ka, 127.11 m). 
5, 325-M0036A-18R-2W 8-10 (20.70 ka, 128.75 m; HY-6). 6, 325-M0054B- 
O8R-2W 73-75 (22.13 ka, 128.54 m). 7, 325-M0033A-11R-CCW 5-11 
(22.11 ka, 106.83 m; HY-5). 8, 325-M0055A-04R-1W 35-40b (21.87 ka, 
103.13 m; NO-8). 9, 325-M0032A-10R-1W 18-20 (23.49 ka, 107.95 m, 
HY-3). 10, 325-M0033A-13R-CCW 1-3 (23.62 ka, 109.46 m). 11, 
325-M0055A-04R-2W 99-105 (23.97 ka, 104.84 m; NO-5). 
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Extended Data Fig. 7 | GIA results for selected sites. Calculations, for modelling with various earth parameters (lithospheric thickness 

selected far-field sites (a-f), using the ice model based on the lower-bound ©H=70km, upper-mantle viscosity 107°-10*! Pa s, lower-mantle viscosity 
of RSL from GBR. GIA calculations implemented using parameters of 1071-10? Pa s), and a melting model (of ice history), which, in this case, 
upper-mantle viscosity and lithospheric thickness of 10°° Pas and 70 km was the MIN model. The red lines are for H=70km, upper-mantle 

and lower-mantle viscosity of 10?” Pa s. However, the differences between _ viscosity 2 x 10”° Pa s and lower-mantle viscosity 10” Pa s. The blue lines 


data and calculations for the far-field sites of Tahiti (b), Bonaparte (c) and show the case for H=70 km, upper-mantle viscosity 2 x 107° Pa s and 
Sunda (d) indicate a better match with GIA when the higher RSL obtained —_ lower-mantle viscosity 10” Pa s. Vertical uncertainties are depth ranges for 
from this study is used (Fig. 4) in calculating the global deglacial sea each sea level indicators and horizontal error bars are 2c uncertainties for 
levels. The grey band represents the range of RSL predictions using GIA age estimations reported from the literature. 
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Extended Data Fig. 8 | Comparisons between the ANU GMSL and the 
present GBR-based GMSL. a-d, Total (a), North America (b), Eurasia (c) 
and Antarctic (d) ice sheet growth and melting histories for the past 

35 kyr inferred from the present study (red bands) are compared with 
previously reported model (blue line). The blue curves represent the 
ANU GMSL results*, whereas the red bands, covering a range of GBR- 
RSL-based GMSL, are from this work. During the transition from LGM-a 
to LGM-b, around 21 kyr ago, there is enhanced precipitation over the 


North America 


0 10 20 30 


Time (kyr BP) 


North American Ice Sheet and to a lesser extent over Antarctica, although, 
for the latter, the ice volume continues to build up, for longer, until the 
termination of LGM-b approximately 17 kyr ago. The manually adjusted 
nominal ANU ice model will influence our inferred melting history for 
each ice sheet. Furthermore, although the ANU model differs in some 
respects from other ice sheet reconstructions™, the results from these 
simulations are not very different. 
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Extended Data Fig. 9 | GIA model results for HYD and NOG with 
different viscosity settings. MAX (a) and MIN (b) represent the 
maximum and minimum extremes of GBR RSL. Blue and red bands are 
RSL ranges derived from our study for Hydrographer’s Passage (HYD) and 
Noggin Pass (NOG). Grey bands represent the range of predicted sea levels 
using the new ice model. Lithospheric thickness is fixed at 70 km, whereas 
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10°10?! Pa s and 10*!-10?? Pa s. RSL predictions for representative 
viscosities for HYD-01C and NOG-01B are shown as red and blue solid 
and dotted lines respectively. Vup and Viow represent the upper- and 
lower-mantle viscosity values. (obs., observations.) 
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Extended Data Table 1 | Simplified coral and coralline algal assemblages and their likely palaeoenvironmental setting 


Assemblage 


Corals 


cA 


cB 


cc 


cD 


Coralline algae 


aAl 


aA2 


aA3 


Main components 


Massive/robust branching /sopora & medium 


corymbose branching Acropora humilis gr. 


Fine branching Seriafopora and Acropora sp. 


Massive & encrusting meruliniids 
(Dipsastrea, Cyphastrea, Platygyra) 
Encrusting to Massive Porifes & encrusting 


Montipora 


Thick crust of Porolithon onkodes 


Thin crusts (<0.2 mm) of P. onkodes. 
Potential occurrence ofPorolithon gardineri, 
Harveylithon gr. munitum 

Absence of aA1 & aA2. Occurrence 

of thin crusts of Mesophylium, Lithothamnion 


and Lithoporella 


*When associated with centimetres-thick aAl CAR crusts and vermetid gastropods. 
**When associated with thin crusts of aA3 and lacking vermetid gastropods. 
*«*Palaeowater depths were estimated by comparison with their modern Indo-Pacific counterparts. 


Paleoenvironmental setting 


Shallow, high energy/exposed 


reef edge 


Wide range of protected reef 
settings 

Wide range of protected to turbid 
reef settings 


Very wide range of reef settings 


Shallow reef settings 


Wide range of reef settings 


Deep fore-reef slope settings 
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Paleowater depth** 


0-10 m; 0-5 m* 


0-20 m; semi exposed, 0-10 m* 


0-30 m; 0-10 m* 


0-60 m; 0-10 m*, 20-102 m** 


0-10 m 


0-20 m 


>20 m 
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Hominin occupation of the Chinese Loess Plateau 
since about 2.1 million years ago 


Zhaoyu Zhu!*, Robin Dennell**, Weiwen Huang?*, Yi Wu, Shifan Qiu®, Shixia Yang*’, Zhiguo Rao®, Yamei Hou”, Jiubing Xie’®, 


Jiangwei Han'° & Tingping Ouyang!" 


Considerable attention has been paid to dating the earliest 
appearance of hominins outside Africa. The earliest skeletal and 
artefactual evidence for the genus Homo in Asia currently comes 
from Dmanisi, Georgia, and is dated to approximately 1.77-1.85 
million years ago (Ma)!. Two incisors that may belong to Homo 
erectus come from Yuanmou, south China, and are dated to 1.7 Ma’; 
the next-oldest evidence is an H. erectus cranium from Lantian 
(Gongwangling) —which has recently been dated to 1.63 Ma*— 
and the earliest hominin fossils from the Sangiran dome in Java, 
which are dated to about 1.5-1.6 Ma‘. Artefacts from Majuangou 
IID> and Shangshazui® in the Nihewan basin, north China, have 
also been dated to 1.6-1.7 Ma. Here we report an Early Pleistocene 
and largely continuous artefact sequence from Shangchen, which 
is a newly discovered Palaeolithic locality of the southern Chinese 
Loess Plateau, near Gongwangling in Lantian county. The site 
contains 17 artefact layers that extend from palaeosol $15—dated 
to approximately 1.26 Ma—to loess L28, which we date to about 
2.12 Ma. This discovery implies that hominins left Africa earlier 
than indicated by the evidence from Dmanisi. 

The Lantian area of the Shaanxi province lies on the southern margin 
of the Chinese Loess Plateau (Fig. 1), which covers about 270,000 square 
kilometres’ (Fig. 1). Loess deposition in China since 2.6 Ma has been 
massive, with up to 100-300 m being deposited on the Loess Plateau’. 
Typical aeolian loess—palaeosol sequences are divided into 33 couplets 
of alternating palaeosol (S) and loess (L) sediments from top to bot- 
tom, from the SO-L1 couplet to the $32-L33 couplet”*. This sequence 
records East Asian monsoon development since 2.6 Ma and correlates 
with marine isotope stage 1 to marine isotope stage 102 or 104%. Loess 
units correspond to even-numbered marine isotope stages (cold periods 
or glacial stages) and palaeosol units correspond to odd-numbered 
marine isotope stages (warm periods or interglacial stages). Numerous 
sections indicate that Loess Plateau sequences can be correlated over 
distances of >1,100km from west to east and >600 km from north to 
south. Palaeomagnetic dating, which establishes palaeomagnetic reversal 
boundaries, is the most reliable method for dating these sequences; 
for example, the Matuyama/Brunhes boundary occurs in the S7-L8 
couplet (and sometimes in S8); the Jaramillo subchron between 
L10 and S12; and the Olduvai subchron in the lower part of L25 to 
the base of $26”*!°-! (see Supplementary Information). Through 
palaeomagnetic dating and astronomical tuning'®", the age of each 
loess and palaeosol has been calculated in two schemes to within a 
precessional cycle (Supplementary Table 1) and used in international 
correlations of the Quaternary geological timescale’”'®; here we use the 
‘Chiloparts’ timescale’ (Supplementary Table 1). Because of questions 
about the positions of geomagnetic reversals in loess—palaeosol 
couplets’*!? and time delays of about 10-30 thousand years for the 


recording of polarity reversals in loess”°, we date each unit to within a 
precessional cycle rather than to the 1-thousand-year precision of the 
Chiloparts timescale. 

The main palaeoanthropological limitation of the Loess Plateau is 
that it largely lacks evidence of the Early Palaeolithic, or associated 
hominin remains. The main exception is Lantian county in which a 
mandible and cranium of H. erectus have been found at Chenjiawo 
and Gongwangling, respectively. Both were dated by palaeomagne- 
tism and correlation with the Luochuan loess—palaeosol sequence”!. 
At Chenjiawo the mandible occurs in palaeosol S6 (684-710 thousand 
years ago), and at Gongwangling the layer containing the cranium is 
now correlated with $23 (1.59-1.65 Ma)?. Although some stone arte- 
facts were found in Lantian county during the 1960s”, they could not 
be assigned precise geological contexts. A few artefacts have recently 
been dated to the middle-to-late Pleistocene**”». 

In our 2004-2017 investigations, we found a new Palaeolithic locality 
at Shangchen (34° 13’ 33’ N, 109° 28’ 39” E) about 4km north of 
Gongwangling. It is about 74m thick and complete from L5 to L28 
(equivalent to marine isotope stage 12 to marine isotope stage 80) 
(Fig. 2). After we found artefacts in loess and palaeosol sections, our 
main aim was to assign these a precise age. Particular attention was 
paid to evidence of multiple flaking, preferably from more than one 
direction; the presence of clear flake scars, percussion bulbs, retouch 
and secondary flake removals to distinguish artefacts from geofacts 
(Supplementary Information). In some sections, palaeomagnetic 
samples were taken next to where stone artefacts were discovered to 
confirm that they were in undisturbed deposits. Stone artefacts and 
mammalian fossils were also found at the base of $27 and the upper 
part of L28 in a small excavation in 2017. 

Preliminary classifications of loess—palaeosol horizons are based on 
the identification of marker layers (palaeosol $5 and loess horizons L9, 
L15, L24 and L25) during our field investigations. Grain-size, chem- 
istry and mineralogy analyses were performed to further distinguish 
sediment type (Extended Data Fig. 1). These indicate that the strata are 
aeolian, as are other typical Chinese loess deposits”*'*-!®, Rock magnetic 
measurements indicate that the sediments are suitable for palaeo- 
magnetic dating (Supplementary Information and Extended Data 
Fig. 2). High-resolution magnetic susceptibility curves, generated from 
1,076 samples with an approximately 6.9-cm average sampling interval, 
show variations that are the same as those of representative Chinese 
loess sections and confirm our pedostratigraphic identification. 
Systematic high-resolution palaeomagnetic measurements by progres- 
sive thermal demagnetization, based on 722 specimens with an approx- 
imately 10-cm average sampling interval, were used to determine the 
characteristic remanent magnetization direction, after removing 
one or two soft magnetization components (Extended Data Fig. 3). 
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Fig. 1 | Location map of the Chinese Loess Plateau and the Lantian 
area. The Qinling Mountains form the climatic and biological boundary 
between the temperate north and subtropical south of China. The 
locations of major hominin sites in China, including the Nihewan basin 
and Yuanmou, are indicated. The Lantian basin lies in the southern Loess 
Plateau and north of the Qinling Mountains in the temperate zone of north 
China with Palaeolithic localities such as Gongwangling, Chenjiawo and 
Shangchen in the Lantian area. The main section of Shangchen is situated 
on denudated loess tablelands at an altitude of 898 m, along the north 
bank of the Bahe River. Duanjiapo is one of the typical sections of the 
Chinese loess—palaeosol sequence. Localities marked as 1 and 2 refer to 
Gongwangling 1 and Gongwangling 2, respectively. 


The magnetic polarity for each specimen was used to establish the local 
magnetostratigraphy (see Fig. 3, Supplementary Information, Extended 
Data Figs. 4, 5 and Source Data), and from this precise ages have been 
assigned to the artefacts in the loess—palaeosol sequence. 

Eight major magnetozones are recorded in the Shangchen section 
(Fig. 3), four of which have normal polarity (N1 to N4) and four of 
which have reversed polarity (R1 to R4). By comparison with the geo- 
magnetic polarity timescale”*’’, magnetozone N1 corresponds to the 
Brunhes chron, and the Matuyama/Brunhes boundary (0.78 Ma) is 
in horizons S7 to L8; magnetozone N2 corresponds to the Jaramillo 
subchron (0.99-1.07 Ma) and lies between horizons S10 and $12; mag- 
netozone N3 corresponds to the Olduvai subchron (1.78-1.95 Ma), with 
its top and base lying within the lower part of L25 and the base of $26, 
respectively; and magnetozone N4 corresponds to the Réunion excur- 
sion (2.13-2.15 Ma) in L28. The four reversed polarity magnetozones 
are attributed to different periods of the Matuyama chron. A short-term 
polarity event, labelled e,, might correspond to the Punaruu excur- 
sion at about 1.12 Ma” (see Supplementary Information). Stratigraphic 
continuity and the position of reversal boundaries in the Shangchen 
section indicate that the loess—palaeosol sequence is complete from L5 
to L28 through a date range of approximately 0.41-2.16 Ma'®””. By using 
marker layers and palaeomagnetic reversal boundaries of offset sec- 
tions”? (see Supplementary Information and Extended Data Figs. 4, 5), 
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Fig. 2 | The landscape and loess-palaeosol section of the Shangchen 
Palaeolithic locality. The steep and dissected terrain exposes an aeolian 
loess—palaeosol sequence from L5 (412-479 thousand years ago) to 

L28 (2.11-2.13 Ma) that is fully developed, particularly with marker 
horizons such as the tripartite S5 with the reddest colour and highest 
magnetic susceptibility or the units L9 (869-943 thousand years ago), 
L15 (1.24-1.26 Ma) and L24-L25 (1.65-1.80 Ma), which represent 
particularly thick and silty loess (with coarse grain size) with a notable 
yellowish-white colour and lowest magnetic susceptibility. On the 

basis of these marker horizons, palaeomagnetic sampling and artefact 
collecting have been divided into five subsections along the continuously 
developed gully, and the main stratigraphic sequence has been established 
(see Supplementary Information and Extended Data Figs. 4, 5 for details). 
Many artefacts have been found in situ within the sampled area. 


the comprehensive main section and timescale of the Shangchen locality 
have been established from five subsections. 

At Shangchen, 88 flaked and 20 unmodified stones were found in situ 
between $5 and L28. Here we discuss only those found in situ (n = 96) in 
layers S15 to L28, which date from 1.26 Ma to 2.12 Ma. The 82 flaked and 
14 unmodified stones between S15 and L28 (Supplementary Table 6) 
were collected from 6 loess and 11 palaeosol strata within trenches cut 
for palaeomagnetic sampling, in situ at distances of less than 100m from 
these trenches and exposed in the surfaces of 50-80° slopes, or within 
a 15.4-m? excavated area of $27-L28 (see Supplementary Information 
and Extended Data Figs. 6-10). Five categories of flaked stone were 
recognized: cores (17), flakes (9), flake fragments (8), fragments (14) and 
retouched pieces (34). The 34 retouched items were classified as scrapers 
(17), notches (4), scraper and/or notches (4), points and/or borers (6) 
and picks and/or bifaces (3). Cores were produced by bipolar (n= 2) 
and hard hammer percussion (n = 15), and those on elongate cobbles 
(n=9) were most common. The average number of flakes per core was 
4.6 (range of 1-7), from a mean of 2.3 directions (range of 1-6). As no 
natural stone exposures (for example, stream beds or outcrops) were 
encountered, isolated unmodified cobbles are regarded as manuports 
that may have been intended as cores or used as hammerstones. Two 
pieces identified as hammerstones show percussion damage. All arte- 
facts except one were fresh, with no sign of rolling or abrasion. The most 
common raw materials (Supplementary Table 6) are different types of 
quartzite (n =72) and quartz (n = 20) and their probable source is the 
foothills of the Qinling Mountains—which are situated 5-10 km to the 
south of the site—and the streams flowing from them. The horizon 
with the most stone items is $22 (n = 33, 1.54-1.57 Ma) (Supplementary 
Table 8 and Extended Data Fig. 6). A total of 28 stones (25 flaked and 
3 unmodified) were found below S22 (Fig. 4). In S23 (1.59-1.65 Ma) a 
flake and a flake fragment were found; in $24 (1.71-1.73 Ma), a notch, 
scraper, core and two flake fragments; in L25 (1.73-1.80 Ma), a scraper, 
hammerstone, and two fragments; in $26 (1.89-1.95 Ma), a cobble; in 
L27 (1.95-2.09 Ma), a cobble and a fragment; and in $27 (2.09-2.12 Ma) 
(Extended Data Fig. 7), six artefacts were found (three cores, a core 
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Fig. 3 | Summary of the loess—palaeosol section with artefact sequence 
of the Shangchen Palaeolithic locality. Magnetic susceptibility variations 
(high values in palaeosols and low values in loess), and the positions of key 
marker layers (S5, L9, L15 and L24—L25) are the same as in the standard 
Loess Plateau sections”*'*"!®, The geomagnetic polarity timescale is from 
previous publications*®””. Details of the palaeomagnetic data and artefacts 
within 17 excavation layers (S15, $16, $18-S24, $26 and S27, and L17, 

121, L22, L25, L27 and L28) are provided in Supplementary Information, 
Extended Data Figs. 5-10 and Source Data. Artefacts found above S15 

are not included in this paper because here we focus on the artefacts 
earlier than about 1.2 Ma. There are four normal polarity magnetozones, 
comprising N1 (chron Cln, 0-9.71 m), N2 (Clr.1n, 23.50-26.90 m), N3 


and/or scraper, a pointed piece and a flake tool). In the excavation into 
S27 and L28 (Extended Data Figs. 8, 9), a core and a flake tool were 
found at the S27/L28 boundary, and six artefacts were found in L28 
(approximately 2.12 Ma): three scrapers, a point, a pointed piece and 
a flake. A mandibular fragment of a cervid, a bovid and other fossil 
bone fragments were also found (Extended Data Fig. 10). Using the 
Chiloparts timescale, the stone artefact and fossil horizon in the upper 
part of L28 between the Olduvai subchron and the Réunion excursion 
is dated to about 2.12 Ma. Details of artefact features are indicated in 
Supplementary Tables 6-8. Images of artefacts being extracted from 
original strata are shown in Extended Data Figs. 6-9, and laser three- 
dimensional scans of three pieces are shown in Supplementary 
Videos 1-3. All artefacts in these figures have clear flake features and 
evidence of repeated flaking from more than one direction. They are 
also consistent with the rest of the artefact sequence from Shangchen 
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(C2n, 62.94-68.44 m) and N4 (C2r.1n, 73.08-73.41 m). Four reversed 
polarity magnetozones, comprising R1 (9.71-23.50 m), R2 (26.90- 

62.94 m), R3 (68.44-73.08 m) and R4 (73.41-74.05 m), are attributed to 
different periods of the Matuyama chron. In addition, as an independent 
check on the Chiloparts timescale, we have estimated the ages of the 
main layers by using the average sedimentation rate of each segment to 
obtain the sedimentation rate age (see Supplementary Tables 2-5). The 
age difference between our sedimentation rate age and the Chiloparts 
timescale is small (~2.16%) and we therefore use the internationally 
recognized Chiloparts timescale throughout this paper. Dec., declination; 
GPTS, geomagnetic polarity timescale; Inc., inclination; VGP, virtual 
geomagnetic pole. 


with respect to raw materials, flaking techniques and type of flaking 
product. 

Four aspects of our discoveries at the Shangchen Palaeolithic locality 
are noteworthy. First, we have for the first time—to our knowledge— 
established the age of Early Pleistocene (below L15) artefacts in the 
Loess Plateau within a rigorous stratigraphic framework. Second, stone 
artefacts were found predominantly in 11 palaeosol layers (S15, S16, 
$18-S24, $26 and S27, with 80 items), which developed in a warm 
and wet climate, but in only 6 loess layers (L17, L21, L22, L25, L27 
and L28, with 16 items), which indicate colder and drier conditions 
(Fig. 3). This pattern is consistent with that found in the loess—palaeosol 
sequence in Tajikistan*”. Third, the 17 loess—palaeosol layers with 
artefacts span about 0.85 million years and indicate repeated—but not 
necessarily continuous—hominin occupation of the Chinese Loess 
Plateau between 1.26 and 2.12 Ma, equivalent to marine isotope stage 
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Fig. 4 | Selected artefacts found in situ in layers $27-L28 (2.09- 

2.12 Ma), L27 (1.95-2.09 Ma), L25 (1.73-1.80 Ma) and $23 (1.59- 
1.65 Ma) from the Shangchen Palaeolithic locality. a, SC 20120507-3, 
a flake tool from $27. b, SC 20120507-2, a pointed piece from $27. c, SC 
20120516-1, a core from $27. d, SC 20120507-1, a core from $27. e, SC-B 
D2-2, a bipolar fragment from L27. f, SC 20120502-6, a flake from $23. 


39 to marine isotope stage 80. Fourth, and most importantly, the oldest 
artefact age of approximately 2.12 Ma at Shangchen implies that hom- 
inins had left Africa before the date suggested by the earliest evidence 
from Dmanisi (about 1.85 Ma). This makes it necessary to reconsider 
the timing of initial dispersal of early hominins in the Old World. 
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g, 2010-06 SC-L25, a scraper from L25. h, SC-K5, a flake tool from 
$27-L28. i, SC-K4, a core from $27-L28. j, SC070926-1, a flake fragment 
from S23. Artefacts from S23 are similar in age to the the Gongwangling 
H. erectus cranium’. Each gradation in the scales represents 1 cm. 
Artefacts SC-K4 and SC-K5 are also shown in Extended Data Fig. 10. SC, 
abbreviation for Shangchen site. 
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Extended Data Fig. 1 | Material composition of aeolian deposits in the 
Lantian area. a, b, Grain size distributions for loess (a) and palaeosol 

(b) samples from the Shangchen section. The features of sediments in 
Shangchen section are the same as those from the Lingtai section*!, 

which is one of the typical Chinese loess sections. This indicates that 
sediments from the Shangchen section are aeolian deposits that are similar 
to those at Lingtai. c, Comparison of major elements between the loess 
sediments at Lantian and Luochuan’. The x and y coordinates represent 
logarithmic values of the major element content of loess from Luochuan 
and Lantian, respectively, and the plot shows that loess from the two sites 
is geochemically similar (data fall on or close to the 1:1 line). d, Chondrite- 
normalized rare-earth element distribution patterns for loess and 
palaeosol samples from the Shangchen section. Characteristics of partition 
modes of rare-earth elements from loess and palaeosol samples in the 
Shangchen section indicate they are the same as those from the Luochuan 
section. 


31. Sun, Y.B., Lu, H. Y. & An, Z. S. Grain size distribution of quartz isolated from 
Chinese loess/paleosol. Chin. Sci. Bull. 45, 2296-2298 (2000). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


is 3.40 m, S5 12.00 m, L8 35.05 m, L15 68.75 m, S26 
ro 1 1 
N Ss 
3 Ss 
c 
D 
o 
€ 
3 
oO 
in 
o 
£ 
S 
z 
0 200 400 600 0 200 400 600 0 200 400 600 0 200 400 600 
Temperature (C) Temperature (C) Temperature (C) Temperature (C) 
b 

§ 1')2.02m,s5 36.60 m, L15 54.30 m, S22 ") 69.28 m, $26 69.98 m, L27 
G 
N 
3 0.5: 0.5: 
c 
gz 
£ 0 0 
a] . B,, = 46.04 mT 
o B= 20.61 mT B, = $9.22 mT B, = 35.94 mT B., = 58.99 mT 2 
= 0s Hs B, = 18.96 mT B,= 12.01 pi 08 B. = 18.98 mT B.=1432mT 
¢ M,, = 6.95"104 Am? M,,=5.61*104 Am? M,,=6:31°104 Ame Meee 10 1Ne 
io} M, = 3.92"10% Am? M, = 3.3610? Am? M, = 3.53102 Am? M, = 3.54°10° Am? 70*103 Am? 
Zz ot -1 

-600 -250 0 250 500 -500 -250 0 250 500 -500 -250 10) 250 500 -500 -250 0 250 500 -500 -250 0 250 500 -500 -250 0 250 500 

Field (mT) Field (mT) Field (mT) Field (mT) Field (mT) Field (mT) 
c d 
0 0 
1.0167 
. 
1.012 
e 
L 1.008} ‘ 90 
*e ° . 
ri s 
1.004 o, 
ee ? : 
to o° 
ae quet. » 
Ee y Sms 
1 T 7 m Lower 
4 1.005 1.01 1.015 1.02 Upper 
F 180 


Extended Data Fig. 2 | Rock magnetism. a, Temperature-dependent 
magnetization variations for four representative samples from the 
Shangchen section. Arrows indicate heating or cooling runs. These curves 
were obtained in air using a field of 100 mT, and indicate the presence of 
maghemite, dominant magnetite and haematite. This type of magnetic 
mineral assemblage is typical of that previously recovered for the Chinese 
loess—palaeosol sequence’®*”. b, Hysteresis loops for representative 
samples after correction for paramagnetic slope. M,, saturation 
magnetization; M,,, saturation remanence; B,, coercive force; and B,,, 
coercivity of remanence. c, Anisotropy of magnetic susceptibility for 694 


32. Liu, Q., Deng, C., Torrent, J. & Zhu, R. Review of recent developments in mineral 
magnetism of Chinese loess. Quat. Sci. Rev. 26, 368-385 (2007). 


specimens from loess L5 to loess L28 in the main section that we studied, 
and parallel sections at the Shangchen locality. Left, Flinn diagram. 

L, lineation (Kmax/Kint)- F foliation (kint/Kmin). Right, stereographic 
projections of anisotropy-of-magnetic-susceptibility ellipsoids of 
specimens. Data for Kmax and kmin are shown as black squares and red 
circles, respectively. The data are indicative of normal, undisturbed 
sedimentary fabrics and support the magnetostratigraphic interpretation 
presented in this study. d, Equal area projections for the natural remanent 
(left) and characteristic remanent (right) magnetization directions for all 
694 studied specimens. 
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Extended Data Fig. 3 | Palaeomagnetic dating. a~-o, Demagnetization coordinate axes are in degrees Celsius and mA m|, respectively. In m, n, 
diagrams for representative specimens from the Shangchen section. the specimens from the magnetic horizon of the Réunion excursion are 
Solid and open symbols refer to data projected onto the horizontal and shown. See Source Data for detailed listings of palaeomagnetic data from 
vertical planes, respectively. Scales of demagnetization temperature and the Shangchen section. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | Landscape in which palaeomagnetic sampling 
and artefact collecting were carried out at the Shangchen locality. 

The establishment of magnetostratigraphy in the Shangchen locality was 
based on the traditional and effective-linking methods (using marker 
layers and palaeomagnetic reversal boundaries of offset sections”’). The 
comprehensive main section and timescale of the Shangchen locality have 
been established from five subsections, the spatial distribution of which is 
shown below. a, b, The main sections are exposed continuously along the 
same gully. Because of the steep terrain and multistep gully bottom (see c, 
d), we divided the sampling into four subsections (offset sections) in this 
gully. Several marker layers—such as L9, L15 and L25—that are white and 
thick are obvious in the section at short range and can easily be used to 
link the subsections. The subsections I, II, III and IV contain the following 
layers: I (L5-L15), II (L15-L25), MI (L25-L27), and IV (L27-L28, on the 
other side of the slope). c, The steep subsection II with fresh outcrops 

of original loess and palaeosols. Many artefacts were found within more 


LETTER 


than 10 horizons in subsection II or on both its sides. The first artefact 
was found here in S22. Note the two individuals on the upper part of the 
section as an approximate scale. d, Subsection V, which is a short parallel 
section opposite subsection I] (c). e, Many artefacts were found in situ 

in palaeosol $22 during our sampling and excavation. Palaeomagnetic 
analysis confirms that the artefacts were collected from undisturbed loess 
or palaeosol (see Supplementary Information). The steepness of slope 
(shown in c, d) prevents large-scale excavation. More details can be seen 
in Extended Data Fig. 6. f, g, The section of the exploratory trench (named 
subsection KW) contains layers L25-L28. It is part of the same small hill as 
the main composite section (subsections I-IV), and is located about 500m 
northeast of palaeomagnetic section IV (see Extended Data Fig. 5a). Based 
on the outcrops of marker layers L9, L15 and L25 in the loess—palaeosol 
sequence, these sections can easily be linked (see Extended Data Fig. 5 for 
further details). 
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Extended Data Fig. 5 | Distribution and linking method of sections 

at the Shangchen locality. a, Schematic plan of the subsections of 
palaeomagnetic sampling. The numbers 800 and 900 refer to the height in 
metres above sea level. The contour interval is 20 m. Subsection KW refers 
to the additional parallel section from L25 to L28 with the excavation into 
$27 and L28. b, Schematic of the method used to link the five subsections 
in the Shangchen locality. This figure and Extended Data Fig. 4 show the 
loess—palaeosol stratigraphy, with key marker layers, sampled subsections 
and the locations where artefacts were found. The details of the method 
used to link the five subsections are given below. Subsections I and II 

are linked on the basis of the L15/S15 boundary. Measured horizons of 
subsection I start from the mid-to-lower part of L5 and run downward 

to the base of marker layer L15 (to the boundary of L15/S15). Measured 
horizons of subsection II start from the lowest part of L15 and run 
downward to the lower part of L25. Because the measured thickness of 
marker layer L15 is the same in both subsections I and II, we cut out the 
overlapped L15 in subsection II. Thus, subsection I ends at the base of 
L15, and subsection II begins at the top of $15. Subsections II and II are 
linked on the basis of the top boundary of the Olduvai subchron. A small 
normal-polarity segment of the Olduvai subchron was detected in the 
marker layer L25 at the base of II. Measured horizons of subsection III 
start from the lower part of L25 to the top of L27, and record the entire 
Olduvai subchron with a small reversed-polarity segment on the top (in 
L25) and at the base (top of L27) of the subsection. Thus, the overlapped 
normal-polarity segment of the Olduvai subchron at the base of subsection 
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II and a small reverse-polarity segment on top of subsection III were cut 
out. The reverse-polarity segment at the base of subsection II is linked to 
the normal-polarity top segment of subsection III, and the whole normal- 
polarity segment of the Olduvai subchron is contained within subsection 
III. Subsections III and IV were linked on the basis of the base of the 
Olduvai subchron. Measured horizons of subsection IV start from the 

top of L27 to the mid-to-upper part of L28, with a small normal-polarity 
segment of the base of the Olduvai subchron at the top. After cutting out 
both the small reverse-polarity segment at the base of subsection III and 
the small normal-polarity segment on top of subsection IV, the reverse- 
polarity segment just below the Olduvai subchron in subsection IV is 
linked to the normal-polarity base of the Olduvai segment in subsection 
III. Subsection KW contains horizons of L25 (marker layer) to L28, 
including units L25, $25, L26, $26, L27, $27 and L28. Against the general 
background of reversed magnetostratigraphy, two normal magnetozones— 
the Olduvai subchron (1.78-1.95 Ma) and the Réunion excursion 
(2.13-2.15 Ma)—are recorded. Therefore, section IV and section KW are 
linked by the base of $27 (that is, $27 at the base of section IV is linked to 
L28 at the top of subsection KW). Subsection V is opposite of subsection 
II, which is on the other side of a narrow gully and has two distinct 
marker layers, L15 in the upper half and L25 in the lower half. There is a 
palaeosol layer with many artefacts in the middle part of subsection V. The 
stratigraphic correlation and horizon number demonstrate that this layer 
belongs to $22. Palaeomagnetic measurements on samples collected from 
this parallel section all showed reversed polarity. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Stone artefacts found during the sampling of 
$22 (1.54-1.57 Ma) and $24 (1.71-1.73 Ma) at the Shangchen section. 
a, First discovery of a stone artefact in $22 (‘place I’). b, Schematic profile 
of the section-cutting of place II. c, The place II section was excavated into 
the original palaeosol layer to a depth of about 150 cm. Note the presence 
of stone artefacts on a ledge in the original palaeosol, below the weathered 
crust. d, View of the excavation in subsection V. e, Profile of the original 
slope surface with a weathered crust of only 3-6-cm thickness, and no 
slope wash. Palaeomagnetic analysis of the samples taken around the 
stone artefacts confirmed that they were in undisturbed deposit. f, View 
of the excavation in $24. Note the weathered surface crust upslope, the 


slope wash to the left of the cutting and section cleaning debris below the 
excavation. g, Schematic profile of excavation into $24. h, Side view of 
place I of the section cut by the excavation, showing two artefacts within 
fresh original palaeosol. Note that there is a clear difference between the 
thin weathered crust and the original loess. i-k, Discovery and exposure 
of a large quartzite core in place II of the section (i, j), which led to the 
removal of the core shown in k. Details of the large core (SC080710-1) are 
given in Supplementary Table 6. Analysis of the palaeomagnetic samples 
taken alongside the stone artefacts showed that the loess and palaeosol was 
undisturbed, and did not represent slope wash. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


slope wash 


We have founda 
new stone! After 
cutting into~10 cm 
of the palaeosol, 
anew small stone 
was found in situ in 
the original layer. 


d 


Mark the stone 
and start to dig. 
The outcrop is 

only 7 cm long 

now. 


Hf 
Ait 


Take the stone 
out carefully 


layer. 


Itis a new artefact. 

A flake (SC20120507-3) 
9.2x5.3x 3.2 cmin size. 
Itis embedded 7 cm into 
the original layer. 


Extended Data Fig. 7 | Details of artefacts excavated from the roadside 
section at the Shangchen locality from S27, which occurs below the 
Olduvai subchron and is dated to approximately 2.1 Ma. The hill slope 
is covered by a dark brown slope wash of 10-30-cm thickness that is 
easily distinguishable from the in situ loess in the roadside section. After 
cleaning the section, stone artefacts were exposed between 1.5 and 3m 
below the top of the section, and extracted from in situ loess. a, b, This 
shows the obvious difference between slope wash and the original loess- 
palaeosol layer. b, First finding of a stone standing out of the surface crust 


from the original 
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roadside section 


First surprising finding: 
asmall outcrop of stone in the vertical roadside section. 


Mark the stone and photograph: 
the outcrop is only 7 X 3.5 X 2cm in size. 


Continue 
to dig 


Oh, it is a real stone artefact 


Take it out 


The large core 
(SC20120507-1) 
with a size of 
15.2x14.5x8.3 
cm”, i.e. itis 
embedded 

11.7 cmin the 
original layer. 


which is situated about 1.5 m below the top of slope wash. c-f, The finding 
and collecting process for the artefact is the same in Extended Data Fig. 6. 
Details of the artefact SC 20120507-3—which was found after digging 
>30 cm into the fresh original palaeosol layer—are given in Fig. 4a, 
Supplementary Tables 6, 7 and Supplementary Video 1. g-i, A stone is 
marked with its specimen number before removal (g), and the artefact is 
disclosed step by step (h, i). j, The stone artefact after removal. k, View of 
core (SC20120507-1). Clear flake scars are evident. This piece is shown in 
Fig. 4d and details are given in Supplementary Tables 6, 7. 
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Extended Data Fig. 8 | The stratigraphic partition, grid layout 

and distribution of artefacts and fossils in the exploratory trench 
(subsection KW) in $27 and L28. a, b, The vertical section (a) and the 
horizontal plane (b) of the exploratory trench, which is composed of a 
primary and a secondary trench. The primary exploratory trench is 6m 
long, 1.4-1.78 m deep and 2 m wide, and the secondary trench is 1.7m 
long, 2m deep and 2 m wide. The excavated area (approximately 15.4m”) 
was divided into a grid of 1-m x 1-m squares. I, II, the numbers of grid 
lines on the vertical section. Al-A6 and B1-B6, numbers of the grid 
squares on the horizontal plane in the primary trench. -A1, -B1, the 
numbers of grid lines on the horizontal plane in the secondary trench. 
In the south and the north of the primary trench, 108 palaeomagnetic 


horizontal Plane 


b i 


primary trench 0 1m 


palaeomagnetic 
sampling trench 


samples were continuously collected in two sampling trenches. The 

north sampling trench is 0.8-1 m wide and 2.3 m deep, and the south 
sampling trench is 0.4m wide and 1.2 m deep. Some stone artefacts and 
fossil remains were excavated in situ in S27 and L28 from our exploratory 
trench. The features of eight artefacts can be seen in Fig. 4, Extended Data 
Fig. 10 and Supplementary Table 6. The fossils from the excavation were 
primarily tooth and bone fragments (Extended Data Fig. 10) of different 
animals, including a cervid, a bovid and a suid. No bones or jaws were 
complete. The fossil remains and stone artefacts occurred on the same 
horizon and were close to one another. Note that the artefacts and fossils 
also occur in the deepest part of the trench and not near the modern slope 
surface. 
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Extended Data Fig. 9 | The excavation of the exploratory trench any crops planted at this time). Underlying the cultivated layer are clear, 
(the subsection KW) in layers $27 and L28 at Shangchen locality. homogeneous, solid and undisturbed loess—palaeosol strata. Several stones 
a-c, The primary (a, c) and secondary (b) exploratory trenches. and fossils were found in situ in the original palaeosol-loess strata (S27- 
d, e, Palaeomagnetic sampling trench in the north of the primary 128). These stones and mammalian fossils were kept in the reserved pillars 
exploratory trench. The boundary between the cultivated horizon of palaeosol and loess during the exploration process (a—c). The Réunion 
(cropland soil) and original loess horizon (S27) is very clear. The top excursion is situated in the lower part (L28) of the palaeomagnetic 

layer is a grass-covered cropland soil of a depth of 20-40 cm (without sampling trench (e). 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | Selected artefacts and fossils from the 
exploratory trench (subsection KW) in $27 and L28. These are the oldest 
artefacts discovered at the Shangchen locality during this investigation. 

a, Quartzite core (SC-K4) from the boundary between $27 and L28 at 

a depth of 60-70 cm below the boundary between cultivated soil and 

fresh original palaeosol, and 164cm above the end of the Réunion 
excursion. See Fig. 4i and Supplementary Tables 6, 7 for details. b, c, The 
oldest artefacts SC-K30 and SC-K55 within L28 from the exploratory 
trench, from 69-114 cm above the end of the Réunion excursion. See 
Supplementary Table 6 for details. d, Quartzite flake tool (SC-K5) from the 
exploratory trench, from the boundary between S27 and L28 at a depth of 
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about 75 cm below the boundary between cultivated soil and fresh original 
palaeosol and 149 cm above the end of the Réunion excursion. See Fig. 4h 
and Supplementary Tables 6, 7 for details. e, Flake tool SC-K54 from 
within the L28 from the exploratory trench. f, Mandibular fragment of a 
small bovid (SC-K60) from the same horizon of S27 as the stone artefacts 
found during the excavation of S27 and L28. Top image shows the occlusal 
view of the mandible. g, A left mandibular fragment of a large cervid 
(SC-K10), which was found near and in the same horizon of $27-L28 

as the stone artefacts SC-K4 and SC-KS5. In the scales, each division 
represents 1 cm. 
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Triple oxygen isotope evidence for limited 
mid-Proterozoic primary productivity 


Peter W. Crockford!*+*, Justin A. Hayles*+>, Huiming Bao®°, Noah J. Planavsky’, Andrey Bekker®, Philip W. Fralick®, 
Galen P. Halverson!, Thi Hao Bui!, Yongbo Peng® & Boswell A. Wing!° 


The global biosphere is commonly assumed to have been less 
productive before the rise of complex eukaryotic ecosystems than 
it is today'. However, direct evidence for this assertion is lacking. 
Here we present triple oxygen isotope measurements (A!’O) from 
sedimentary sulfates from the Sibley basin (Ontario, Canada) 
dated to about 1.4 billion years ago, which provide evidence for a 
less productive biosphere in the middle of the Proterozoic eon. We 
report what are, to our knowledge, the most-negative A!70 values 
(down to —0.88%o) observed in sulfates, except for those from the 
terminal Cryogenian period”. This observation demonstrates that 
the mid-Proterozoic atmosphere was distinct from what persisted 
over approximately the past 0.5 billion years, directly reflecting a 
unique interplay among the atmospheric partial pressures of CO, 
and O; and the photosynthetic O2 flux at this time®. Oxygenic 
gross primary productivity is stoichiometrically related to the 
photosynthetic O, flux to the atmosphere. Under current estimates 
of mid-Proterozoic atmospheric partial pressure of CO, (2-30 
times that of pre-anthropogenic levels), our modelling indicates 
that gross primary productivity was between about 6% and 41% of 
pre-anthropogenic levels if atmospheric O2 was between 0.1-1% or 
1-10% of pre-anthropogenic levels, respectively. When compared 
to estimates of Archaean*° and Phanerozoic primary production’, 
these model solutions show that an increasingly more productive 
biosphere accompanied the broad secular pattern of increasing 
atmospheric O, over geologic time’®. 

Modern primary producers perform oxygenic photosynthesis, which 
provides O, to the atmosphere and fixes the carbon that fuels hetero- 
trophic consumption in the global biosphere. In the low-O, mid-Prote- 
rozoic world$, this oxygenic gross primary productivity (GPP) has been 
widely assumed to be less than GPP in more recent times!. Accordingly, 
mechanistic explanations for low-O Proterozoic surface environments 
focus on how to limit the productivity of the Proterozoic biosphere?" 
despite the lack of direct empirical evidence for lower Proterozoic GPP. 
Indeed, proposals for sustaining Proterozoic environmental stasis!*~' 
would be strengthened if their key component could be validated from 
the geologic record. Here we examine whether the mid-Proterozoic 
biosphere was less productive than more recent ones by using a triple 
oxygen isotope proxy for GPP in sulfate minerals that are dated to about 
1.4 billion years ago (Ga). 

A record of the productivity of the biosphere is embedded within the 
mass-independent oxygen isotope fractionation (O-MIF) carried by 
tropospheric O2. Stratospheric photochemical reactions preferentially 
concentrate !’O and '8O in O3 and leave ©O anomalously enriched in 
the residual O2, which therefore has a negative A 170 value 
(A170 = 870 - 0.5305 x 8!8O). Isotopic exchange between strato- 
spheric CO and heavy oxygen atoms from O3 photolysis imparts a 
positive A!’O value to carbon dioxide!>. Mass balance dictates that O2 
exiting the stratosphere carries a negative A'7O value and that the mag- 
nitude of this anomaly is proportional to the atmospheric partial pres- 
sure of CO2 (p . ao In the troposphere, stratospheric O2 mixes with 


the O, produced through photosynthesis—which carries '°O, '7O and 
‘80 in isotopically normal proportions—and a near-zero A'’O value 
is set by the isotopic composition of water!’”. The A!”O value of tropo- 
spheric O, reflects the relative contributions of O2 supplied from the 
stratosphere versus O, derived from photosynthesis!”"8. 

This balance is only interpretable if there is enough atmospheric O2 
to sustain a sizable ozone layer: that is, atmospheric partial pressures 
of O2 (p o,) greater than about 0.1% of pre-anthropogenic levels (PAL) 


(100% PAL = 209,500 p.p.m. O2)!. Although stratospheric oxygen 
dynamics do not permit the production of O-MIF below this Poy such 
atmospheric anoxia supports the photochemical production of sul- 
fur-isotope mass-independent fractionation (S-MIF). As seen in the 
Archaean rock record, the presence of S-MIF and absence of O-MIF 
provides a key metric with which to identify a strictly anoxic atmos- 
phere”? (Extended Data Table 1). Because photosynthetic carbon 
fixation is proportional to O2 production through the photosynthetic 
quotient, at higher Po, the A’’O anomaly in tropospheric O, is a direct 


measure of GPP!”. With independent estimates of contemporaneous 
Pco, and sufficient Pop the A!”0 value of ancient atmospheric 


O2 can constrain ancient “GPP”, 

Sedimentary sulfate minerals can preserve a record of ancient atmos- 
pheric O2. Oxidative weathering of sulfide minerals incorporates a 
sizeable a of oxygen from tropospheric O, into product sulfate 

(denoted f, ). Biologically mediated pyrite say which dominates 
abiotic oxidation in natural environments, produces a range of f, 
between 8 and 15%”. The A!”O value of aqueous sulfate is also influ- 
enced by the ratio of sulfide to sulfate minerals in the source rocks 
undergoing oxidative weathering, as well as the intensity and style of 
microbial sulfur cycling in the aqueous environment”™*, which erases 
the A’’7O signal. For example, modern marine sulfate carries a muted 
but resolvable negative average A'7O value? (approximately —0.08%o; 
Extended Data Table 1) inherited from modern tropospheric O» 
(approximately —0.55%o”°). Terrestrial sulfate-rich evaporative settings 
are more conducive than marine settings to the rapid precipitation of 
gypsum, anhydrite or other sulfate salts”®, and thus both minimize the 
isotopic consequences of microbial recycling”’ and capture newly 
formed sulfate with a A'’O value that more directly reflects tropo- 
spheric Op. 

The Rossport Formation of the Sibley Group in Ontario (Canada) 
contains lacustrine and sabkha sediments with abundant gypsum 
layers and nodules. These sulfate-rich sediments are intruded by 
1.1-billion-year-old dykes, and have an estimated depositional age of 
about 1.4 billion years”* (Supplementary Information). We measured 
the oxygen (6!8O and A’”O) and sulfur (64S and A°?S) isotope compo- 
sitions of sulfate that was chemically extracted from 68 Sibley samples. 
We found A!7O values that were more negative than those of modern 
and Phanerozoic marine sulfates as well as that of modern tropospheric 
O> (Fig. 1), and no evidence of S-MIF (Extended Data Table 1). Our 
results unambiguously provide a near-direct record of mid-Proterozoic 
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Fig. 1 | Triple oxygen isotope data over the past 1.5 billion years. 
Compiled A’’0 data in barites (white diamonds)?*?-”, carbonate- 
associated sulfate (CAS) (grey triangles)??? and evaporites (grey or red 
circles). Analytical uncertainty on A'’O measurements (10) is less than 
0.05%o. Results from this study (n = 68) are compared to average A!”O 
values of modern marine sulfate?*4 (dark-blue dashed line) and modern 
tropospheric oxygen” (light-blue dashed line). Inset is a histogram of 
A170 values from the samples analysed here. Ma, million years ago. 


atmospheric O, and indicate that the bulk atmosphere has existed in 
three isotopically distinct states over Earth’s history: S-MIF but no 
O-MIF (Archaean), O-MIF but no S-MIF (Proterozoic), and no sub- 
stantial O-MIF or S-MIF (Phanerozoic)””° (Extended Data Table 1). 
Unlike syn-glacial Cryogenian sulfate (Extended Data Fig. 1), the 
low 6*4S values, small negative A**S values and high 8'8O values 
reported here suggest weak microbial sulfur cycling and limited sulfide 
reoxidation”®*” in the Sibley basin (Fig. 2). The A’’O values of the 
Sibley Group sulfates, therefore, are primarily modulated by the amount 
of O2 incorporated during sulfide oxidation rather than by the 
weathering and redeposition of pre-existing sulfate minerals or intense 
reoxidative microbial sulfur cycling (Fig. 2). Considering probable fo, 


values for pyrite oxidation (8-15%*), the most negative Sibley A'7O 
value (—0.88%o0; Extended Data Table 1) indicates that coeval tropo- 
spheric O had a A!70 value between —11 and —6%o. This range is 
about 10 to 20 times larger than the A'7O of modern tropospheric O2, 
which demonstrates that the mid-Proterozoic atmosphere bore little 
resemblance to the modern atmosphere with respect to the magnitude 
of its stratospheric and photosynthetic O2 fluxes as well as the size of 
the Oy reservoir in which these fluxes competed. 

Although mid-Proterozoic A'7O and GPP cannot be calibrated from 
modern values”, isotope mass balance calculations indicate that the 
A'70 value of tropospheric O, reflects three key variables—p co, Po, 
and GPP—and that parameters such as troposphere-stratosphere 
exchange exerted considerably less influence’. Under near-modern 
GPP, as may have characterized the immediate aftermath of the termi- 
nal Cryogenian glaciation*', ultra-high atmospheric Poo, (more than 


350 times PAL; 100% PAL= 280 p.p.m. CO2) is the only viable ies e 
impart a sizeable negative A'7O value to tropospheric oxygen”? 
However, in conditions of lower Poo, similarly negative A'7O values 
can be generated if GPP is greatly diminished, which effectively lengthens 
the residence time of photosynthetic O2 in the atmosphere’. We used 
this isotope mass-balance approach? together with independent esti- 
mates for mid-Proterozoic levels of p,, 0, and Po, t show that the A'70 
values of the Sibley sulfate provide empirical evidence that limited 
mid-Proterozoic primary productivity was a biogeochemical reality. 
Estimates of mid-Proterozoic Poo, from geochemical proxies and 
climate models suggest Poo, between 2x and 30x PAL at 1.4 Ga 
(Extended Data Fig. 2). Mid-Proterozoic O2 estimates span a wider 
relative range but fall between 0.1 and 10% of PAL (Extended Data 
Fig. 2). We take an agnostic perspective of the techniques for estimating 
Po, and p,, 0; and use uniform frequency distributions within these 
ranges as the basis for a Monte Carlo resampling of mid-Proterozoic 
O2 and CO; levels. Along with a resampling of other control parameters 
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Fig. 2 | Sulfur and oxygen isotope constraints on limited microbial 
sulfur cycling in the Sibley basin. a, 5*4S and A'70 values exhibit a 
weak positive correlation (n = 64; r? = 0.132 (Pearson’s)), which shows 
that microbial sulfur cycling was not effective in removing anomalous 
A'0 values. b, 54S and A*°S values indicate a sulfur cycle dominated by 
dissimilatory sulfate reduction, (DSR) (grey field)***> with no evidence 
for a reoxidative flux into the sulfate pool (f;o, fraction of sulfide that is 
reoxidized to sulfate). c, Steeply aligned 6**S and 5180 values indicate 
relatively slow sulfate reduction rates (SRR)**. Total analytical uncertainty 
on §4S, A*3S and §!8O measurements is estimated at 0.1, 0.01 and 0.5%o, 
respectively. 


in the isotope mass-balance model (Extended Data Fig. 3), this enabled 
us to turn our measured A?’0 distribution (Fig. 1) into bootstrap esti- 
mates for mid-Proterozoic GPP (Methods). The Monte Carlo estimates 
confirm that it is very likely that the mid-Proterozoic biosphere was 
less productive than the present biosphere, with >93% of the GPP esti- 
mates being less than pre-anthropogenic levels (Fig. 3). 

There is a small tail of relative GPP values above 100% PAL that 
require sulfate A!’O approaching zero, a I near the maximum prob- 
able value and—importantly—p,, close to 10% PAL (Extended Data 
Fig. 4). Although permitted by our analysis, these extreme circum- 
stances were unlikely to have been persistent on the mid-Proterozoic 
Earth, if they were present at all. Mid-Proterozoic GPP estimates are 
always strongly influenced by p, (Extended Data Fig. 4b), which sug- 
gests that the bimodal structure of the GPP probability density (Fig. 3) 
is a consequence of the wide range of Po, values that we used in the 
Monte Carlo calculations. Under more restricted ranges of Po» the 
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Fig. 3 | Empirical probability densities of mid-Proterozoic GPP. Monte 
Carlo solutions are based on the measured distribution of A!’O values of 
the approximately 1.4-billion-year-old Sibley sulfates, and take into 
account uncertainties in the control parameters of the isotope mass- 
balance model. The grey histogram represents the full suite of estimates 
and is well-approximated as the sum of a pair of Gaussian probability 
density functions (root mean squared error = 0.001; Methods). The red 
line represents the probability density function for GPP solutions when 
Po, levels are 1-10% PAL, which is characterized by a mean GPP value of 
41 +3% PAL (uncertainties are 95% confidence intervals on the mean). 
The orange line represents the probability density function for GPP 
solutions when Po, levels are 0.1-1% PAL, which is characterized by 

a mean GPP value of 6 + 1% PAL. 


bimodality disappears and the estimated probability densities for GPP 
take on a normal form (Extended Data Fig. 5, Methods). To explore 
this behaviour, we fit the full suite of GPP results to a pair of Gaussian 
probability density functions, one with a mean GPP of 6 + 1% PAL that 
is representative of low O2 conditions (0.1-1% PAL) and another with 
amean GPP of 41 +3% PAL (quoted uncertainties are 95% confidence 
intervals on the mean; Fig. 3) that is representative of high O2 condi- 
tions (1-10% PAL). The lower GPP state is consistent with lower mean 
Po, (about 0.3% PAL), and the higher GPP state corresponds to higher 
(about 2% PAL; Extended Data Fig. 5). The trajectory of these 


estimates reflects the positive covariation between GPP and Po, 


mean Po 


that scales across a range of GPP and pj, values (Extended Data 
Fig. 4b). 

Approximately 700-800 million years after the deposition of the 
Sibley sulfates, there was a major ecological shift in the marine realm 
that saw a cyanobacterial phytoplankton population replaced by one 
dominated by algae*’. This ecological transition has previously been 
linked to a rise in nutrient levels!”, consistent with eukaryotic predom- 
inance in more nutrient-rich regions of the modern oceans™. In light 
of the timing of this transition and the marked similarity of our higher 
GPP estimate to algal-dominated primary production in today’s ocean 
(about 45% of the primary production of the modern biosphere**), we 
suggest that our lower GPP estimate (and, as a result, a lower Po, ) is 
more likely to reflect the mid-Proterozoic world. Enhanced regenerated 
production in a bacteria-dominated, oligotrophic marine ecosystem***° 
could have supported higher overall GPP in nutrient-limited 
Proterozoic oceans!. However, there is evidence that intense hetero- 
trophic recycling of carbon appeared only in planktonic ecosystems in 
the latest Proterozoic*”?”8, providing further support for our lower 
GPP- “Po, estimate. Although reduced Proterozoic GPP is typically 
linked to the need to throttle O, generation on geologic timescales via 
diminished export production and organic carbon burial®'!”, the 
GPP-Pp,, covariation described here highlights the fact that diminished 
O, production may have also directly led to reduced O; levels on the 
Proterozoic Earth. Along with Archaean*® and Phanerozoic’ GPP 
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estimates that bookend our inferred Proterozoic GPP (about 6% PAL), 
this relationship suggests that primary production has progressively 
increased in concert with the broad two-step history of O2 in Earth’s 
atmosphere’. 


Online content 

Any Methods, including any statements of data availability and Nature Research 
reporting summaries, along with any additional references and Source Data files, 
are available in the online version of the paper at https://doi.org/10.1038/s41586- 
018-0349-y. 
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METHODS 


Sample information. Sample locations. Sediments of the Sibley Group were 
deposited over an approximately 70,000 km? area that extends in to northern 
Lake Superior, and to the margins of Lake Nipigon in Ontario, Canada (Extended 
Data Fig. 6). While the surface expression of the Sibley Group is typically poor, 
extensive mineral exploration in the area has provided kilometres of drill core. 
For this study, 68 samples were collected from four different drill cores: NI-92-7 
(UTM east 353850, north 5443000), NB-97-2 (UTM east 426990 north 5416241), 
NB-97-4 (UTM east 425430 north 5410540) and WP-07-03 (UTM east 379602 
north 5422047). 
Age constraints. Maximum age constraints for the entire Sibley Group are pro- 
vided by rhyolites that unconformably underlie the Sibley Group with a U-Pb 
date of 1,530.5+6.2 million years ago (Ma)**. A minimum age is constrained by 
cross-cutting diabase sills dated at 1,109.7 +2 Ma. The only age measured directly 
on Sibley Group material is provided by a whole-rock Rb-Sr isochron that gave a 
calculated minimum date of 1,339 + 33 Ma for the Kama Hill Formation shales*’. 
Palaeomagnetic pole positions place the Sibley Group’ on the apparent polar 
wander path of the Belt Supergroup at about 1.4 Ga”. In light of these constraints 
and the stratigraphic location of the Rossport Formation below the Kama Hill 
Formation (see below), we consider the approximate age of the material analysed 
for this study to be 1.4 Ga. 
Depositional setting. Gypsum nodules used in this study were taken from the 
Firehill Member of the Rossport Formation, which was deposited after the fluvial 
to near-shore lacustrine sandstones and conglomerates of the Pass Lake Formation 
and before the saline mud-flats of the Kama Hill Formation”®. The Rossport 
Formation is subdivided into three members beginning with the Channel Island 
Member at the base*’. The Channel Island Member is characterized by cyclic silt- 
stone-dolostone couplets with evaporite minerals preferentially being deposited 
in dolostone layers as nodules and bladed crystals*!. The Channel Island Member 
contains massive beds of sandstone near its top and is overlain by the Middlebrun 
Bay Member”. This 0.5-3-m thick, stromatolitic dolostone-chert represents the 
shoreline of the saline lake”*. The Middlebrun Bay Member is overlain by the 
Firehill Member, which consists of massive to finely laminated red siltstones with 
sporadic intraformational mass-flow conglomerate beds. Deposition occurred on 
mud- and sand-flats with a near-surface, saline water table?**!. Sulfate minerals 
throughout the Firehill Member have been observed to occur as nodules, bladed 
crystals, veins, cements and detrital grains; however, the sample set in this study 
consisted of only nodules. The Firehill Member is terminated by the appearance of 
ripple marks and hummocky cross-bedded sandstones and mudstones of the Kama 
Hill Formation, marking the transgression of a large water mass. Together these 
observations have shaped the view that the Pass Lake and Rossport Formations of 
the Sibley Group were deposited in predominantly fluvio-lacustrine and sabkha 
settings, although additional work is needed to constrain the influence of marine 
waters on the system. 
Isotopic measurements. A!”O measurements. We followed a previously detailed 
analytical procedure”, Drill core samples with abundant gypsum nodules were 
selected for oxygen isotope analyses (Extended Data Table 1). After a thin layer was 
milled from the outer surface to avoid potential modern contamination, samples 
were drilled to collect about 30 mg of material for pre-treatment. Drilled powders 
were first dissolved into a 0.1 M sodium hydroxide-0.05 M diethylenetriamine- 
pentaacetic acid (DTPA) solution to extract sulfate into solution and remove any 
non-sulfate oxygen-bearing species®’. The extracted sulfate samples were then 
reprecipitated at 80°C by acidifying with double-distilled 6 M hydrochloric acid 
followed by the addition of drops of concentrated barium chloride solution. This 
dissolution and reprecipitation was repeated to further eliminate possible con- 
taminations. Approximately 10 mg of purified BaSO, was then loaded onto a 316L 
stainless steel plate and placed under a bromine pentafluoride (BrF;) atmosphere 
for 12 h to eliminate any water absorbed by the samples. Molecular oxygen was 
generated from the samples using a CO>-laser fluorination system. Although 
approximately 25 j1mol of O2 gas was generated (25-35% yield) for each sample, 
this process is not observed to induce any significant isotopic fractionation for 
A170 values*?. Molecular O2 produced during the fluorination process was then 
taken through a number of cryo-focusing steps to remove condensable gases, 
followed by collection onto mol sieve 5A at —196°C before expansion into the 
sample bellows of a Thermo MAT 253 mass spectrometer for dual-inlet analysis. 
A'O values are expressed arithmetically as A170 =8'70 — 0.5305 x §8O 
(0.5305 represents a high temperature limit for O-!”0-180 fractionation***) 
and §O= ([‘Rsample/ ‘Ry.smow] — 1) x 1,000, in which 'R represents the ratio of 
‘O (i=17 or 18) to 160, and calculated values are reported as parts per thousand 
(%o) on the Vienna Standard Mean Ocean Water (V-SMOW) scale. We used an 
arithmetic definition for A!”O instead of a logarithmic definition—for example, 
A”O =1n(8!70 + 1) - 0.5305 x In(6!8O + 1)—to enable direct comparison 
between measured A!7O values and A!’0 values calculated from the isotope 
mass-balance model. Although a logarithmic definition for A!7O is commonly 


LETTER 


used owing to a weaker dependence on the isotopic composition of reference mate- 
rial6”°8, differences between arithmetic and logarithmic A!O values calculated 
in this study were typically in the order of 0.001%. This deviation is much less 
than the uncertainty on A!’O analyses (10 =0.05%o), estimated based on multi- 
ple measurements of the same BaSO, calibrated against UWG-2 (taken to have 
880 = 5.80%bo and §!’0 = 3.016%o on the V-SMOW scale; Extended Data Table 1, 
Extended Data Fig. 1). 

5'8O measurements. Because the low yield of laser-fluorination techniques leads 
to a significant fractionation in the measured 880 value of sulfates, we com- 
busted sulfate samples and measured the major oxygen isotope composition 
on the resulting carbon monoxide (CO)“4. Measurements for 6!8O values were 
made on the same aliquots of sample used for A!’O analysis that underwent the 
DTPA reprecipitation treatment. Analyses were performed using a Temperature 
Conversion Elemental Analyzer connected to a Conflo-III and a Thermo MAT 253 
mass spectrometer in continuous-flow mode. The estimated total analytical error 
for 5!8O analyses is estimated to be less than 0.5%o from repeated measurements 
of laboratory standards (USGS34 &!8O = —27.80%0; IAEA-NO3 6180 = 25.6%o; 
USGS35 6°O = 56.80%0). 

54S and A*¥S measurements. Sulfur isotope measurements were made using puri- 
fied BaSO, produced from the DTPA re-precipitation treatment for oxygen isotope 
analysis. Approximately 10 mg of barite powder was reacted with 15 ml of Thode 
reduction solution at 100°C for at least 2 h°*. Powders reacted completely to pro- 
duce H,S that was carried through a condenser in a N; gas stream and bubbled 
into a 0.4 M zinc acetate solution in which the H2S was quantitatively captured 
as ZnS. Samples were then reacted with ~5-10 drops of 0.2 M AgNO; solution 
to convert ZnS to Ag»S. The Ag»S was then filtered, collected and dried for 12 h. 
Approximately 3 mg aliquots of dried Ag,S were then loaded into nickel bombs 
and heated to 250°C for 12 h under a fluorine gas atmosphere to generate SF gas 
for analysis. The resulting SF, was purified cryogenically and chromatographically 
before isotopic analysis on a Thermo MAT 253 mass spectrometer in dual-inlet 
mode. 

Sulfur isotope compositions are expressed as 8'S = ([‘Rsampte/‘Rv-cpt] -1) 
x 1,000, in which ‘R represents the ratio of ‘S (i=33 or 34) to 7S, and V-CDT 
indicates the Vienna Canon Diablo Troilite scale. A**S values are expressed as 
A®sS = §S - 1000 x ([1-++ (8*4S/1,000)]°°!> — 1). 

Results were normalized against repeat analyses of international reference mate- 
rial LAEA-S1, which we take to have §*4S = —0.3%o and A**S = —0.061%o on the 
V-CDT scale. The estimated (1c) total analytical uncertainty on the entire proce- 
dure is estimated to be better than 0.1%o for §*4S and 0.01%o for AS (Extended 
Data Table 1). 

Model description. We adapted an analytical solution to a steady-state model’ for 
the A’”O value of tropospheric O, that allows for variation in Po, Peo, strato- 
sphere-troposphere transport and stratospheric O,-O3-CO, photochemistry 
(Extended Data Table 2). As described below, we made three primary changes to 
this model. First, we cast the model solution into a form that incorporated p, , 
P-o, and GPP estimates relative to a set of reference values for PAL (Extended 
Data Table 2). Second, we rearranged the model solution to solve GPP as a function 
of measured A?’0 values and model parameters. Third, we inverted the model 
solution to solve p, as a function of measured A 170 values, other model param- 
eters and inferred GPP. We present the equations used to enact these changes in 
the Supplementary Information and describe the constraints that we applied to the 
various model parameters below. Finally, we outline our Monte Carlo approach 
for estimating probability densities for GPP and Po, 

Constraints on model parameters. Poo, One-dimensional radiative convective 
models have been used to estimate p._ levels required to maintain Earth surface 
temperatures of 273 K and 288 K, placing broad upper and lower limits on p,._ at 
1.4 Ga of 1 and 100 PAL, respectively! (Extended Data Fig. 2). A further account- 
ing of transport process through extrapolation of results from the CAM3 general 
circulation model refines these values considerably®. For example, to maintain an 
average surface temperature of 288 K under a pure CO) atmosphere an upper 
estimate of p._ at 1.4 Ga is calculated to be 30 PAL, and an atmosphere including 
1 PAL N2 and 10~‘ bar of methane provides a lower bound at 5 PAL (Extended 
Data Fig. 2). These ranges are consistent with results from the COPSE Earth System 
model that put forward a range of 8-20 PAL® for Poo, at 1-4 Ga (Extended Data 
Fig. 2). : 

Geochemical approaches have also been used to constrain Proterozoic 
CO, levels. A previous study attempted to constrain Poo, at 1-4. Ga by relating 
carbon isotope fractionations between organic matter in microfossils and 
surrounding carbonate to extracellular CO, levels™. Such estimates remain poorly 
calibrated in the laboratory, especially at high CO, levels, so we do not include the 
suggested 10-200 PAL range put forward through this approach. Quantification 
of silicate weathering in response to different levels of p.. in profiles of 1.8- 
billion-year-old ( Poo, = PAL) and 1.1-billion-year-old (p,, =1PAL) palaeosols 
suggests Pug, levels were about 2-20 PAL when extrapolated to 1.4 Ga®, A lower bound 
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of 2 PAL approximates the CO, threshold (350-550 p.p.m.) thought to maintain 
an ice-sheet-free Palaeogene Earth® (Extended Data Fig. 2). Distilling results from 
these different approaches and considering the robustness of the different results, 
we set a uniform uncertainty envelope around CO, levels at 1.4 Ga of 2-30 PAL 
(Extended Data Fig. 2). 

Po, The presence of A'70 anomalies requires a threshold amount of atmospheric 
oxygen to allow for the inception of photochemical reactions involving ozone. 
Calculations suggest this threshold exists ata p, value of 0.001 PAL!**” (Extended 
Data Fig. 2), well above the levels calculated for the removal of mass-independent 
sulfur isotope anomalies®. It is for this reason that Archaean samples (Extended 
Data Table 1) do not bear anomalous A!’O signatures”’ even though Archaean 
Poo, concentrations were probably at much higher levels than the modern 
environment. 

Studies based on mineral stability observed through iron loss within palaeosols 
estimated that mid-Proterozoic O, levels were greater than 0.01 PAL”. Recent work 
has examined the kinetics involved in oxidizing terrestrial Mn or Fe and tracked 
this through the isotopic composition of chromium (Cr) through the mid- 
Proterozoic sedimentary record. These Cr-isotope-based estimates provide a 
threshold estimate of mid-Proterozoic pat <0.01-0.001 PAL, with evidence for 
Po, levels above this only appearing after 1.2 - 0.8 Ga’!~73 (Extended Data 
Fig. 2). Palaeosol and Cr-isotope-based estimates contrast with those suggested 
through trace metal enrichments and biomarkers in 1.4-billion-year-old shales 
that have been used to argue for p,, levels >0.04 PAL’*75, These estimates are 
consistent with a broad range of values implied by tracking Zn/Fe ratios of car- 
bonates over the past 3.5 billion years of Earth history that place Po, at less than 
0.06 PAL over the mid-Proterozoic”®. 

Modelling studies have added to this debate by suggesting that mid-Proterozoic 
Po, must be much less than 0.4 PAL” to be consistent with no evidence for per- 
sistent fully oxygenated oceans in preserved marine sediments”®*”? or 0.1 PAL to 
be consistent with the stability of the stable carbon isotope record®’. Biological 
modelling constraints have been provided by calculating O2 requirements for 
Grypiania spiralis, and were suggested to be between 0.01-0.1 PAL, placing a lower 
limit on Po, upon the emergence of these fossils in the geologic record by 1.4 Ga®! 
However, estimates using this logic have been brought down considerably in recent 
years to about 0.0015 PAL from calculations based on minimum requirements of 
simple bilaterians and about 0.004 PAL from experimental results on sponges 
grown in the laboratory®*”*? (Extended Data Fig. 2). 

All of these studies speak to reduced Po, in the mid-Proterozoic compared to 
more recent Earth history (Extended Data Fig. 2), but to date there is no clear 
consensus. With this in mind, we remain largely agnostic to the techniques used 
to derive previous estimates and apply a uniform distribution between a lower limit 
of 0.001 PAL (0.1% PAL) and an upper limit of 0.1 PAL (10% PAL) as our initial 
estimate for mid-Proterozoic Po, (Extended Data Table 2, Extended Data Fig. 3). 
fo2. Two laboratory studies*”** Rave attempted to quantify different pathways of 
pyrite oxidation and the proportion of atmospheric O) that is incorporated into 
product sulfate (f : In experiments with Acidithiobacillus ferrooxidans it was 
determined that between 8 and 15% of oxygen in product sulfate was from atmos- 
pheric oxygen”. In abiotic experiments it was determined that 13% of oxygen in 
product sulfate was from atmospheric oxygen”. Both experiments were conducted 
at low pH values of 2.2-3”. In a second study, abiotic experiments were conducted 
over a much broader pH range (2-11) and O incorporation into sulfate during 
pyrite oxidation was determined using both major (6!8O) and minor (A!”0) oxy- 
gen isotopes*4. In these experiments it was determined that 21-34% of oxygen in 
sulfate was sourced from atmospheric oxygen*. Although it remains difficult to 
determine the exact proportion of oxygen within sulfate that is sourced from H,O 
and Oy, these previous studies provide a conservative range of f. of 8-34%. When 
considering the differences in kinetics between abiotic and biotic pyrite oxidation 
in modern settings, however, this range can be refined. Experiments and natural 
observations have shown that biological mediation can markedly increase pyrite 
oxidation kinetics and therefore is more likely to dominate natural surface envi- 
ronments both at present and probably in the past”!**. Therefore, in this study we 
assume a uniform distribution of possible des values between 8 and 15% (Extended 
Data Table 2, Extended Data Fig. 3). 

Oxygen isotope fractionation in O2-O3-COz photochemical reaction networks. We 

follow a previously published approach? and use previously published results of 
photochemical experiments” to constrain the parameters: /, X), Xs "mes, /0, 

and !7p eit 2/0," Published fits to the experimental results for 8°06; jo, #84 func: 
tion of Po, Wea ratios constrain fp to 1.230 +0.325, X; to 64.045. 3%o and X;, 
to 146+ + 2% 050 (all uncertainties 1a; Extended Data Table 2). We used these values 
and their uncertainties to define Gaussian probability density functions for po, Xi 
and X;, (Extended Data Fig. 3). 

We made a linear least-squares fit to the§8Osst 5/0, and sont »/0, measurements 
from a previously published study® (Extended Data Table 1) and used 
this fit to derive a linear relationship between §'’08%' jo, and §8oust Jo, The 

2/2 2/2 


coefficients of the resulting relationship are: 7 méeos 0 /0, = 9-5167 + 0.0336 and 
708 oo. = —8.052 + 3.673 (all uncertainties lo; Extendéd Data Table 2). We used 
these values and their uncertainties to define Gaussian probability density func- 
tions for! "mes, o, and DEG" 0, (Extended Data Fig. 3). The coefficients derived 
here differ gO com the previously published approach? owing to the fact that 
we define A!”O values with a slope of 0.5305°**° rather than 0.52. 

Gamma (y). We use an estimate of the mass of the modern atmosphere 
(5.148 x 10!8 kg)** and a recent determination of the stratosphere-troposphere 
mass flux (2.194 x 10!” kg yr~1)8” to derive an updated estimate for the strato- 
sphere-troposphere exchange rate in the modern atmosphere (0.0426 yr}; 
Extended Data Table 2). It is not well-known how stratosphere—-troposphere mass 
transfer will change under atmospheric states different from the modern state. 
Previous work has explored how the Brewer-Dobson circulation may change with 
elevated CO; levels, and suggests that increasing p.,_ by a factor of two will 
increase mass flux across the tropopause by about 20% with further CO; increases 
only slightly affecting stratosphere-troposphere exchange®*. As we lack a full quan- 
titative exploration of the effect of mid-Proterozoic atmospheric state on y, we take 
a statistical approach here and allow 7 to vary by about 50% according to a Gaussian 
probability function defined by a 1o value of 0.01 (Extended Data Table 2, 
Extended Data Fig. 3). 

Theta (0). We use the calculation procedure described above to define a value for 
the fraction of stratospheric O taking part in the stratospheric O.-O3-CO, reac- 
tion network under pre-anthropogenic levels (p, =1 PAL; Pp, CO, =1 PAL; 
GPP =1 PAL). Within this calculation framework, a valuie of 6 of 0.1156 feproduces 
the A!”O of modern tropospheric O2 (—0.546%o under the definition of A!’O used 
here)”>. Although it is unknown how this value will change under different atmos- 
pheric states, we assume that 6 is fixed by the development of an ozone layer and 
hold it constant throughout the calculations presented here. 

Model calculations and sensitivity. We estimated probability densities for 
mid-Proterozoic GPP by repeating the calculation procedure described above 
10,000 times, each time drawing a single estimate for each model parameter 
according to the distributions shown in Extended Data Fig. 3 and a single estimate 
for the A!”O of mid-Proterozoic sulfate randomly from the dataset measured here 
(Fig. 1, Extended Data Table 1, Extended Data Fig. 7). A realization of the GPP 
probability density from this Monte Carlo resampling and calculation procedure 
is shown in Fig. 3 and archived in Supplementary Table 1. We also plot individual 
GPP results against various control parameters of the model in Extended Data 
Fig. 4. Although we permitted potential variability in A!”O due to local processes 
(for example, changes in sulfide oxidation through variable f, » that may have 
varied stratigraphically (Extended Data Fig. 7), as well as global processes (for 
example, changes to stratospheric oxygen dynamics through a variable 
stratosphere-troposphere exchange rate), these plots show that GPP is most 
sensitive to variations in p,, , with a near log-linear relationship between assumed 
Po, and calculated GPP (Extended Data Fig. 4b). Much weaker dependencies on f, 
and A?”Oguifate are apparent. Although a small percentage of the calculated GPP 
values are greater than 1 PAL, the sensitivity exercise illustrates that these values 
require a combination of high Po. small A!’Ogutfate and large Tes, (Extended Data 
Fig. 4). This combination of extreme conditions is unlikely to have characterized 
the mid-Proterozoic Earth, given the probability distributions derived here 
(Extended Data Fig. 4). 

The strong control of p ‘0, 00 the calculated GPP values led us to investigate more 
restricted ranges of Pp, between 0.1% and 1% PAL and between 1% and 10% PAL. 
In both cases the bimodality seen in the full suite of Monte Carlo calculations (Fig. 3) 
disappears, and both sets of calculations are well-approximated by single-peaked 
Gaussian probability density functions (Extended Data Fig. 5). This confirms that 
Po, is the dominant control on the bimodal structure seen in the full suite of Monte 
Cailo calculations (Fig. 3), and justifies our division of those results into a pair of 
Gaussian probability density functions, one associated with p, between 0.1% and 
1% PAL and another associated with p,, between 1% and 10% PAL 

We fit the estimated GPP probability density to combinations of Gaussian prob- 
ability density functions and found that a pair of such functions well-approximated 
the calculated results (Fig. 3). The parameters describing these two Gaussian prob- 
ability density functions can be found in Supplementary Table 1 and indicate that 
our measurements of the A'7O of mid-Proterozoic sulfate are consistent with two 
mean values of GPP (41 +3% PAL and 6+ 1% PAL, in which the uncertainty 
reflects 95% confidence intervals on the mean). As described above, we inverted 
our calculation procedure to estimate the specific Po, values that are consistent 
with each of these GPP estimates (Extended Data Fig. 5) and found that the high 
GPP state corresponds to p._ of about 2% PAL, and the low GPP state corresponds 
to p,_ of about 0.3% PAL (Supplementary Table 1). This direct relationship between 
GPP and Po, is present over a wide range of GPP values (Extended Data Fig. 4b) 
and appears to reflect a fundamental characteristic of the mid-Proterozoic Earth. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 


© 2018 Springer Nature Limited. All rights reserved. 


Data availability. The authors declare that all data supporting the findings of this 
study are available within the paper and its Supplementary Information. 
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Extended Data Fig. 1 | Histograms of existing data for A’’O values 
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Extended Data Fig. 2 | Compiled p___ and Po, estimates. a, p, 
estimates. Left y axis, percentage of PAL; right ¥ axis, p.p.m. Grey band 
outlines results from 1D modelling®! based on temperatures of 273 K 
(bottom), 288 K (top) and changing solar luminosity. Red dotted lines 
represent extrapolated general circulation model (GCM) results from 
Archaean estimates. The green-shaded region represents the uncertainty 
envelope of palaeosol-based estimates® with the green dotted lines 
interpolating between estimates at 1.8 and 1.1 Ga together. The pink- 
shaded region represents estimates based on the COPSE Earth system 
model®. The brown bar represents modelling-based estimates required to 
prevent a global glaciation at 1.1 Ga®®. The dark-blue square is the 


microfossil-based estimate that sets maximum limits at 1.05 Ga””?!. Yellow 
arrows represent the upper (30 PAL) and lower (2 PAL) limits used in this 
work, Data are from previous publications®!-©> 8", b, Po, estimates. 
Green arrows represent biologically based estimates; blue afrows represent 
geochemical estimates; and in red are modelling p, estimates. Purple 
lines represent the removal of S-MIF”, a proposed bistability field®’, and 
constraints on the establishment of a modern-like ozone layer'?*”. The 
yellow dashed line represents the suggested limits for the removal of deep 
ocean anoxia’®, and the grey dashed line represents the appearance of 
charcoal. Data are from previous publications?:19°71°870-72,74:76.77:80-83,93,94, 
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Extended Data Fig. 4 | Realization of about 10,000 Monte Carlo 
calculations of GPP (PAL) relative to various control parameters in the 
isotope mass-balance model. a, Unlike model calculations for the 
Neoproterozoic era, there is no clear strong dependence of GPP on 
assumed p_. . b, There is a clear log-linear dependence of the GPP 
estimates on Po, c, GPP responds weakly to to, with large fractions of O2 


in sulfate (which indicate smaller A'7O 2 values) leading to higher 
estimates of GPP. d, The response of GPP to A!’Oguifate is similar to the 
response to f, . Smaller A’’O,ujfate Values indicate smaller A'7O) values, 
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Extended Data Fig. 5 | Realization of about 20,000 Monte Carlo 
calculations of GPP (PAL) and Po ® Results are calculated to be 
consistent with the A!”O measurements in Extended Data Table 1 and the 
probability distribution functions shown in Extended Data Fig. 2, with the 
exception of p, . These calculations assume restricted ranges of p, 
between 0.1% afd 1% PAL (orange histogram) and between 1% and 10% 
PAL (blue histogram). In both cases the bimodality seen in the full suite of 
Monte Carlo calculations (Fig. 3) disappears, and both sets of calculations 
are well-approximated by single-peaked Gaussian probability density 
functions. This confirms that Po, is the dominant control on the bimodal 
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structure seen in the full suite of Monte Carlo calculations (Fig. 3), and 
justifies our division of those results into a pair of Gaussian probability 
density functions, one associated with Pp, between 0.1% and 1% PAL and 
another associated with p, between 1% dnd 10% PAL. b, Results are 
solutions that are consistent with the A!”Ogutfate dataset (Extended Data 
Table 1), the probability density functions shown in Extended Data Fig. 1 
(with the exception of p, ) and the mean GPP estimates of the two 
inferred Gaussian probability distributions for mid-Proterozoic GPP 
(Fig. 3). GPP was allowed to vary in a Gaussian fashion between the 95% 
confidence limits on the GPP mean values. 
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Extended Data Fig. 6 | Geological map of the Lake Nipigon-northern Lake Superior region. This figure was adapted from a previous publication”*. 
© 2008 Canadian Science Publishing or its licensors. Reproduced with permission. 
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Extended Data Fig. 7 | Oxygen and sulfur isotope compositions (A170, 8180, 5*4S and A*°S) for sulfates from drill hole NI-92-7 plotted against 
stratigraphic height. Uncertainty on all analyses is smaller than the sizes of the data points. 
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Extended Data Table 1 | Isotopic data and comparisons with previously published results 


A 
PF-5 z i NI-92-7-15.4 
PF-12 . . NI-92-7-15.95 -0.62 8.1 10.5 -0.029 
PF-2 : i NI-92-7-26.8 -0.61 9.2 9.9 -0.018 
PF-20 ! . NI-92-7-32.58 -0.66 8.6 10.4 -0.017 
PF-11 5 . NI-92-7-35.75 -0.58 8.3 
PF-21 . NI-92-7-51.45 -0.61 10.0 8.3 -0.033 
PF-4 z i NI-92-7-52.3 -0.60 6.9 
PF-10 i NI-92-7-54.25 -0.53 8.3 9.9 -0.044 
PE-7 ; E NI-92-7-57.3 -0.60 8.4 9:9 -0.048 
PF-8 : i NI-92-7-78.3 -0.55 72 9.6 -0.054 
PF-9 5 . NI-92-7-88.5 -0.58 9.4 11.8 -0.062 
PF-9 re-run NI-92-7-126.75 -0.56 10.2 OF -0.048 
PF-6 . H . NI-92-7-158.7 -0.77 77 5.3 -0.046 
PF-6 re-run NI-92-7-164.7 
PF-15 fi A WP-07-03-1 -0.69 6.5 
PF-13 . WP-07-03-2 -0.84 6.9 5.6 -0.045 
PF-13 re-run WP-07-03-3 -0.58 ‘etal 
PF-1 i WP-07-03-4 -0.49 8.0 
PFI7 fl i WP-07-03-5 -0.79 73 
PF-3 . WP-07-03-8 -0.70 6.6 
PF-3 re-run WP-07-03-10 -0.76 8.5 
PF-16 i H WP-07-03-10 -0.88 6.8 5.9 -0.220 
PF-18 : : . WP-07-03-11 -0.71 6.8 
PF-19 i WP-07-03-12 -0.82 6.6 
PF-18- K WP-07-03-12 -0.72 6.6 
03RM26 y i WP-07-03-13 -0.83 7.0 
03RM27 E . WP-07-03-15 -0.85 7.0 
03Rm62 . i WP-07-03-18 -0.84 8.1 
04Rm9 . ; . WP-07-03-19 -0.87 8.4 47 -0.41 
04RM30 z E . WP-07-03-20 -0.73 8.4 
04RM31 5 ! i WP-07-03-21 -0.85 8.9 6.4 -0.24 
NI-92-7-2.35 : . WP-07-03-23 -0.70 8.8 
NI-92-7-3.7 b . WP-07-03-216.6 -0.88 Ted 
NI-92-7-6.2 . H WP-07-03-217.6 -0.83 6.5 
NI-92-7-8.72 

B 


Marinoan Barite Marinoan CAS 


<0.0001 
0.96 


Significant? No No Yes 
C 

Samples This Study Phanerozoic Cryogenian Barites Cryogenian CAS 

Evaporites 

Total number of values 68 51 213 25 

Minimum -0.88 -0.34 -1.05 -1.64 

25% Percentile -0.76 -0.18 -0.57 -0.63 

Median -0.66 -0.12 -0.42 -0.36 

75% Percentile -0.58 -0.06 -0.26 -0.12 

Maximum -0.35 0.00 -0.02 -0.04 

Mean -0.67 -0.13 -0.44 -0.51 

Std. Dev. 0.12 0.08 0.23 0.48 

Std. Err.Mean 0.01 0.01 0.02 0.10 

-95% CI Mean -0.71 -0.16 -0.47 -0.71 

+95% CI Mean -0.64 -0.11 -0.40 -0.31 
D 

Sample avo Description 

Dresser Mine -0.076 23.5 Ga barite samples from North Pole Dome, West Australia” 

PCNP-1 -0.039 

PCNP-2 -0.069 

PCNP-4 0.083 

Mean -0.025 95% E:ror on Mean = 0.080 


a, Data for Sibley samples measured in this study. Associated errors on total laboratory procedures and analysis are presented in the Methods. b, Results of a linear regression analysis for A!7O— 
8!80 (n= 64), 5180-8345 (n = 46), A170-834S (n= 46) and A33S-84S (n = 46) data from this study, along with results from previously published syn-Marinoan CAS29, and post-Marinoan barite23942. 
Correlations (Pearson's) are significant if P < 0.05. ¢, Summary statistics of A!’0 results on samples from this study compared to post-Marinan barites?3?*?, syn-Marinoan CAS?9 and Phanerozoic 
evaporites*. Samples were binned in 0.1% increments. d, New A!”0 measurements from Archaean barites. Data are from a previous publication®. 
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Extended Data Table 2 | Summary of reference parameters for model calculations 


Parameter Definition Value Ref. 
pO2 at 1 Pre-Anthropogenic Level (PAL) 1 PAL = 209500 ppmv 

PCO, ref pCO, at 1 Pre-Anthropogenic Level (PAL) 1 PAL = 280 ppmv 

Pref Reference pO2-pCO? ratio at 1 Pre- 748.21 see 
Anthropogenic Level (PAL) text 

Via Pre-anthropogenic O2 residence time 1244 yr 18 

AO modem 05 A'O of modern Q) using arithmetic definition -0,546%o 25 
for A'’O and slope of 0.5305 

fo, Fraction of O2 incorporation during sulfide 0.08-0.15 22, 
oxidation 

y Rate constant for stratosphere-troposphere 0.0426 yr see 
exchange text 

6 Fraction of O2 in stratospheric O2-O3-CO2 0.1156 see 
cycles text 

Po pO2-pCO? ratio in O2-O3-CO2 isotope 1.23 60 
fractionation experiments 

Xx: 51° Oco,-0, Value in O2-O3-CO; isotope 64%o 60 
fractionation experiments at low pO2 

Xp 51° Oc0,-0, Value in O2-O3-CO; isotope 146%o 60 
fractionation experiments at high pO 

TM E0;-O> Slope of fit to 8'®O¢,.0,and At”0¢o,-0, using 0.5167 3,60, 
arithmetic definition for A!’O and slope of see 
0.5305 text 

EO s-05 Intercept of fit to &'®0¢9,.0,and At” O¢0,.0, -8.025%o 3,60, 
using arithmetic definition for A!’O and slope of see 
0.5305 text 


Data are from previous publications?18.22:25.60, 
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The outer membrane is an essential load-bearing 
element in Gram-negative bacteria 


Enrique R. Rojas!*, Gabriel Billings*, Pascal D. Odermatt!, George K. Auer®, Lillian Zhu!, Amanda Miguel!, Fred Chang?, 
Douglas B. Weibel®”®, Julie A. Theriot?! & Kerwyn Casey Huang)?! 


Gram-negative bacteria possess a complex cell envelope that 
consists of a plasma membrane, a peptidoglycan cell wall and an 
outer membrane. The envelope is a selective chemical barrier! 
that defines cell shape” and allows the cell to sustain large 
mechanical loads such as turgor pressure’. It is widely believed 
that the covalently cross-linked cell wall underpins the mechanical 
properties of the envelope**. Here we show that the stiffness 
and strength of Escherichia coli cells are largely due to the outer 
membrane. Compromising the outer membrane, either chemically 
or genetically, greatly increased deformation of the cell envelope 
in response to stretching, bending and indentation forces, and 
induced increased levels of cell lysis upon mechanical perturbation 
and during L-form proliferation. Both lipopolysaccharides and 
proteins contributed to the stiffness of the outer membrane. These 
findings overturn the prevailing dogma that the cell wall is the 
dominant mechanical element within Gram-negative bacteria, 
instead demonstrating that the outer membrane can be stiffer than 
the cell wall, and that mechanical loads are often balanced between 
these structures. 

The three essential layers of the Gram-negative cell envelope (Fig. 1a) 
are chemically and structurally diverse: the plasma membrane is a fluid 
phospholipid bilayer, the peptidoglycan cell wall is a covalently cross- 
linked macromolecule, and the outer membrane possesses phospho- 
lipids in the inner leaflet and lipopolysaccharides (LPS) in the outer 
leaflet. A primary role of the envelope is to sustain mechanical forces’, 
and it is universally assumed that the mechanical integrity of the enve- 
lope is conferred by the cell wall*°. However, the unique chemistry of 
the outer membrane leads to notable physical properties. For example, 
although proteins freely diffuse in the plaama membrane, the motion 
of outer-membrane proteins is constrained*®. In this light, we inves- 
tigated whether the outer membrane contributed to the mechanics of 
the cell envelope. 

To assay the mechanical properties of the E. coli envelope, we first 
measured its contraction when turgor pressure (approximately 1 atm*”) 
was eliminated by subjecting cells to a large hyperosmotic shock"°. This 
shock induced plasmolysis'! whereby the inner membrane receded 
from the cell wall (Fig. 1b, Supplementary Video 1), indicating that 
the complex formed by the cell wall and the outer membrane had con- 
tracted to its relaxed state (Extended Data Fig. 1). Plasmolysis caused 
the length of the cell wall to contract by ¢,=9.6 + 2.9% (ep=(h — h)/b, 
in which 1; is the pre-shock length and I, is the post-shock length; 
mean +s.d.; Fig. 1c, d). To test whether the outer membrane affected 
this response, we damaged it by exposing plasmolysed cells to a 
detergent (5% N-lauroyl sarcosine; Fig. 1b, Extended Data Fig. 2a—d, 
Supplementary Videos 1, 2). Cells lysed within minutes of detergent 
treatment (Fig. 1b, Supplementary Video 1), coinciding with a brief 
phase of rapid cell-wall expansion (Fig. 1c) that was probably caused 
by viscous drag on the cell wall by the cytoplasm as it was released 


from the cells (Supplementary Information, Extended Data Fig. 2e, f). 
Cell-wall length then contracted beyond its post-plasmolysis value by 
€17= 14.54 8.3% (€)= (hh — low)/ew in which I, is the cell-wall length 
after detergent treatment; Fig. le, Supplementary Video 1). Thus, 
although plasmolysis caused the cell wall to contract, compromising 
the outer membrane caused a second, larger, contraction (P= 107°, 
Student’s two-sided t-test) that was not due to changes in cell-wall 
composition (Extended Data Fig. 2g) or compaction by the detergent 
(Supplementary Information, Extended Data Fig. 3a, b). We observed 
similar behaviour under other growth conditions, in other E. coli strains 
and in other Gram-negative species, but not in the Gram-positive 
bacterium Bacillus subtilis (Supplementary Information, Extended Data 
Fig. 3b—f). 

Under turgid conditions, the cell wall is under extreme extension: 
between the turgid state and the fully relaxed state, the cell-wall length 
contracted by a total of ¢;=25.0 + 8.6% (€;=( — low)/lews n= 56), with 
increased contraction at higher detergent concentrations (Extended 
Data Fig. 4a). In addition, total contraction was correlated with the 
residual phase density of the cell after lysis (Fig. 1b, arrow), which was 
caused by retention of specific proteins within the sacculus (Extended 
Data Fig. 4b-h, Supplementary Information). The least phase-dense 
cells contracted by as much as 50% (Fig. 1f, Extended Data Fig. 4h). 
These data suggest that after lysis, residual cytoplasm within the 
envelope caused an entropic, turgor-like pressure within cells, indicating 
that our measurement of the contraction upon lysis was actually 
lower than it would have been if all cytoplasmic contents were lost. By 
comparison to the cell wall, the relative length extension at which 
typical materials plastically deform ranges from approximately 0.01% 
to approximately 5% for pure elements!’ and is approximately 10% 
for agarose gels!°. 

Our results suggested that the outer membrane was stabilizing the 
cell wall in a highly stretched state during plasmolysis by bearing 
compressive stress, thereby balancing tensile stress in the wall (Fig. Ic, 
right). This model implies that the relaxed size of the outer membrane 
is larger than that of the cell wall (and larger than the size of the cell 
envelope after plasmolysis) and that the outer membrane can bear 
mechanical forces comparable to those borne by the wall. To estimate 
the rest length of the outer membrane, we plasmolysed cells and then 
digested their cell walls with lysozyme, thereby allowing their outer 
membranes to relax (Fig. 2a). As cell-wall digestion caused the cells to 
form spheroplasts (Fig. 2a), we measured the surface area of the outer 
membrane and calculated the length that it would have had in a rod- 
like shape, given this surface area. We found that the rest length of the 
outer membrane (/,,) was precisely equal to the length of the turgid 
cell (lom/f = 1.00.11 for E. coli MG1655, n= 12, Fig. 2b, with similar 
values for other wild-type strains, Extended Data Fig. 3g), and was 
therefore larger than the length after plasmolysis. Thus, during steady- 
state growth, the outer membrane bears little or no load. Using this 
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Fig. 1 | Detergent treatment after plasmolysis causes further contraction 
of the Gram-negative cell wall. a, Model of the cell wall-outer membrane 
complex as parallel linear springs with spring constants kw, kom, and rest 
lengths |, Jom. b, E. coli cells (turgid, plasmolysed (Plasm.), lysed) stained 
with WGA-488 and FM 4-64. The white arrow indicates residual phase 
signal after lysis. Images are representative of 84 cells from 3 experiments. 
Outer mem., outer membrane. c, Left, cell-wall length versus time during 
hyperosmotic shock and treatment with detergent. Data are representative 
of 84 cells. The red arrow indicates sharp swelling upon lysis. Right, a 
model of the turgid, plasmolysed and lysed cellular states. d-f, Histograms 
of length contraction upon plasmolysis (d, n =79 cells), lysis (e, n =56 
cells) and in total (f, n = 56 cells). Circle and error bars indicate the 

mean + s.d. 


finding, we solved for the stiffness of the outer membrane by treating 
the cell wall-outer membrane complex as parallel linear springs 


Ey 


k om = ———k 
om e(4 +) cw (1) 


from which it follows that the outer membrane is stiffer than the cell 
wall, with kom = 1.32kgy. 

We thus sought to dissect the molecular basis underlying the 
load-bearing capacity of the outer membrane by altering its compo- 
sition via chemical and genetic means. Divalent cations mediate ionic 
bonds between negatively charged moieties within LPS, including 
charges in the lipid A and the core oligosaccharide domains". Notably, 
we found that Vibrio cholerae O1, which has one phosphate group 
within lipid A substituted", has a relatively compliant outer membrane 
(Extended Data Fig. 3f). Accordingly, we hypothesized that ionic bonds 
were important for the load-bearing capacity of the outer membrane 
and that disrupting these bonds would decrease its stiffness. Treating 
cells with EDTA, which chelates magnesium, released LPS from the 
outer membrane!®!” (Extended Data Fig. 2d), and had no direct 
effect on cell-wall length (Extended Data Fig. 5a—c) or composition 
(Extended Data Fig. 2g). We therefore predicted that EDTA treatment 
after plasmolysis would cause a cell-wall contraction similar to deter- 
gent treatment. Indeed, the walls of plasmolysed cells began to contract 
shortly after EDTA was added (Fig. 2c) and the degree of contraction 
increased with EDTA concentration (Extended Data Fig. 5d). Adding 
saturating magnesium blocked this effect (Extended Data Fig. 5e). 
When we repeated the plasmolysis and lysis experimental sequence 
(Fig. 1c) with the addition of 10 mM EDTA in both the plasmolysis 
and lysis media, cell walls contracted: i) much more upon plasmolysis 
(Fig. 2d); ii) much less upon detergent treatment (Fig. 2e) and iii) the 
same total amount as untreated cells (Fig. 2f), reducing the estimated 
stiffness of the outer membrane by 70% (Fig. 2g). 

The dye FM 4-64, which intercalates into the outer membrane, did 
not have a significant effect on outer-membrane stiffness (Fig. 2d-g), 
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whereas genetic alterations to the outer membrane had marked effects. 
A mutant E. coli strain with the imp4213 allele of IptD, encoding a 
component of the LPS assembly machinery, has a porous outer mem- 
brane rich in phospholipids!*!. As with EDTA-treated cells, imp4213 
cells contracted more upon plasmolysis (Fig. 2d) and less upon lysis 
(Fig. 2e), with no change in total contraction (Fig. 2f), indicating 
reduced outer-membrane stiffness (Fig. 2g). A mutant lacking the 
abundant outer-membrane protein OmpA exhibited a similar pheno- 
type (Fig. 2d-g), demonstrating that proteins also contribute to 
outer-membrane mechanics. 

The LPS of most Gram-negative bacteria include a polysaccharide 
component called the O antigen (although many domesticated strains 
do not). Notably, adding the O8 antigen to a domesticated E. coli 
strain (AB1133) that exhibited no contraction upon lysis (Extended 
Data Fig. 3h) greatly reduced its contraction upon plasmolysis (Fig. 2d, 
Extended Data Fig. 3h) and increased its contraction upon lysis (Fig. 2e, 
Extended Data Fig. 3h), suggesting that the antigen greatly stiffens the 
outer membrane (Fig. 2g). The O8 antigen is an electrically neutral 
poly-mannose molecule, demonstrating that non-ionic interactions 
between LPS molecules can also contribute to outer-membrane 
stiffness. 

For the outer membrane to exert force on the cell wall during 
plasmolysis, there must be mechanical coupling between these struc- 
tures. Deletion of pal—a gene that encodes a component of the Tol-Pal 
complex, which bridges the outer membrane and the cell wall—resulted 
in cell walls that underwent extreme contraction upon plasmolysis 
(Fig. 2d) and no contraction upon lysis (Fig. 2e), implicating this 
complex functions as a mechanical linker. Braun's lipoprotein (Lpp) 
is a highly abundant protein that also binds both the cell wall and the 
outer membrane. Mutant cells lacking /pp contracted more upon both 
plasmolysis and lysis, yielding a very large total contraction (Fig. 2d-f). 
Because the estimate for the ratio between the stiffness of the cell wall 
and the outer membrane did not change (Fig. 2g), but the total con- 
traction increased markedly (Fig. 2f), we speculate that deletion of [pp 
affects the coupling between the cell wall and the outer membrane as 
well as the stiffness of the cell wall and/or the outer membrane directly. 

To confirm that our measurements of envelope stiffness were not 
affected by variation in initial turgor pressure between conditions or 
strains, we subjected E. coli cells to small oscillatory hyperosmotic 
shocks (Fig. 2h, Supplementary Video 3), which caused cells and their 
envelopes to oscillate in size as they grew (Fig. 2i) but caused negligible 
plasmolysis’®. We hypothesized that the amplitude of cell-envelope 
oscillations should depend on envelope stiffness and should therefore 
be affected by perturbations to the outer membrane. In the absence 
of outer-membrane perturbation, cell-wall length oscillated with an 
amplitude of 2.2 + 0.2%. Treating cells with EDTA during the oscil- 
lations starkly increased the amplitude of the oscillations (Fig. 23, j, 
Extended Data Fig. 5f-k), as did FM 4-64 (Fig. 2j). As predicted, muta- 
tions that decreased outer-membrane stiffness (imp4213 and AompA) 
increased the oscillation amplitude (Fig. 2j, Extended Data Fig. 51), 
whereas the O8 antigen decreased the amplitude (Fig. 2j). Deletion 
of Ipp increased the oscillation amplitude, whereas deletion of pal did 
not (Fig. 2j). These data support our model that although Lpp weakens 
the outer membrane and/or the cell wall as well as their coupling, the 
Tol-Pal complex is responsible only for mechanical coupling, which is 
less important during small shocks that do not plasmolyse the cells!°. 

Each of the measurements described above tested the mechanical 
properties of the outer membrane under compression. E. coli cells 
do not swell under hypoosmotic shock", limiting our ability to test 
whether the outer membrane can bear tensile loads (although a very 
stiff outer membrane could account for this observation). To deter- 
mine whether the outer membrane affects the mechanical proper- 
ties of the cell envelope under other types of load, we used atomic 
force microscopy (AFM) to directly measure the stiffness of turgid 
cells, which is dependent on envelope stiffness’. Progressive addition 
of EDTA or FM 4-64 decreased cell stiffness in a concentration- 
dependent manner (Fig. 3a, b, Extended Data Fig. 6a, b). Mutations 
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Fig. 2 | Cellular mechanical properties are dependent on the 
composition and integrity of the outer membrane. a, E. coli cells (turgid, 
plasmolysed and lysozyme-treated) stained as in Fig. 1b (one experiment, 
similar results for two other wild-type strains, Extended Data Fig. 3g). 

b, Mean ratios between the rest length of the outer membrane, Jo, and 

the length of the turgid cell, 1}, and between /, and the length of the cell 
envelope during plasmolysis, /,. Data are mean + s.d. from 12 cells. P value 
from a paired two-sided Student's t-test. c, Length of representative cell 
walls versus time during hyperosmotic shock and treatment with EDTA 
(n= 184 cells total). d—f, Cell-wall length contractions upon plasmolysis 
(d), lysis (e) and in total (f) under chemical and genetic perturbations to 
the outer membrane (ratio with respect to the wild type (WT); n =79, 309, 
65, 70, 65, 55, 59 and 50 cells for the wild-type, EDTA, FM 4-64, imp4213, 
AompA, +08, Alpp and Apal groups, respectively). P values are from a 
Student’s two-sided t-test of the difference from the untreated wild-type 
control. NS, not significant. ¢ (in e), the ratio for cells expressing the O8 


that weakened the outer membrane during osmotic shock-based 
assays also decreased cell stiffness (Fig. 3c, Extended Data Fig. 6c). 
However, the O8 antigen did not increase cell stiffness (Fig. 3c), 
despite the strong phenotype we observed in osmotic shock-based 
assays (Fig. 2d-g, j). We speculate that because the O8 antigen greatly 
increases the length of LPS, it results in a thick layer that ‘pads’ the 
cell against perpendicular indentation forces but can nonetheless 
bear in-plane loads. Deletion of [pp markedly reduced cell stiffness 
(Fig. 3c). Deletion of pal rendered the cells extremely sensitive to 
indentation such that they lysed when addressed with the AFM can- 
tilever, precluding stiffness determination. 

Next, we used a microfluidics-based assay”! to measure the defor- 
mation of wild-type cells in response to bending forces before and after 
treatment with 10mM EDTA. A perpendicular fluid force was applied 
to filamentous cells (Fig. 3d), and the bending rigidity was calculated 
by fitting the deflection of the cell (Fig. 3e) to a mechanical model!. 
Untreated cells deflected less than did EDTA-treated cells, yielding 
bending rigidities of 5.8 x 10~*°40.4N m? and 2.4 x 107° £0.2N m’, 


Time (min) 


antigen was very large and not well-defined because the contraction 
upon lysis for the parental wild-type strain (AB1133) was approximately 
equal to zero, and adding the antigen markedly increased contraction 
(Extended Data Fig. 3h). g, Outer-membrane stiffness under chemical 
or genetic perturbations (ratio with respect to the wild type). Data are 
mean +s.d. Uncertainty propagated from ¢] and ¢, measurements. 

h, Sorbitol concentration in growth medium during 100-mM oscillatory 
osmotic shocks with a 2-min period. i, Representative cell-wall lengths 
during shocks in h (blue traces; n = 243 cells). A, amplitude of cell- 

wall length oscillations. Green shading indicates the period in which 
EDTA was included. The red curve represents the effective population- 
averaged length. j, Mean amplitude of cell-wall length oscillations during 
100-mM oscillatory shock with a 2-min period (n= 10 cycles for each 
measurement). Data are mean + s.d. P values are from a Student’s 
two-sided t-test, of the difference from the untreated wild-type control. 


respectively (Fig. 3f), again demonstrating the contribution of LPS to 
outer-membrane stiffness against various types of load. 

Given the profound contribution of the outer membrane to the 
mechanical properties of the cell envelope, we next explored their 
contribution to cell physiology. Wild-type E. coli cells gradually lysed 
during moderately large (400 mM) oscillatory osmotic shocks (Fig. 4a, 
Supplementary Video 4). The chemical perturbations N-lauroy]l sar- 
cosine, EDTA and FM 4-64 markedly decreased cell survival in the 
presence of shocks (Fig. 4a, b, Extended Data Fig. 7a, Supplementary 
Video 4). In the absence of shocks, EDTA and FM 4-64 caused 
little and no lysis, respectively (Fig. 4a), demonstrating that outer- 
membrane strength is important for cell survival during mechanical 
perturbations but not during steady-state growth. Genetic mutations 
that weakened the outer membrane or disrupted coupling between 
the outer membrane and the cell wall also markedly decreased cell 
survival during shocks (Fig. 4b, Extended Data Fig. 7b, d), whereas the 
presence of the O8 antigen delayed the onset of lysis (Fig. 4b, Extended 
Data Fig. 7c). 
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Fig. 3 | The stiffness of turgid cells is dependent on outer-membrane 
integrity. a, AFM measurements of cell stiffness versus time. One cell was 
treated with increasing EDTA concentrations (similar data for four treated 
and two untreated cells). b, Mean cell stiffness versus EDTA concentration 
(n=4 cells for each measurement). Data are mean +s.d. P values are from 
a Student’s two-sided t-test of the difference from the untreated wild-type 
control. c, AFM measurements of cell stiffness under chemical or genetic 
perturbations (relative to the untreated wild type, n =7, 4, 3, 13, 8, 7 and 

7 cells for the wild-type, EDTA, FM 4-64, imp4213, AompA, +08, Alpp 
and Apal groups, respectively). ND, Apal cells lysed under AFM. Data 

are mean +s.d. P values from a Student’s two-sided t-test of the difference 
from the untreated wild-type control. d, Microfluidic cell-bending assay. 

e, Displacement versus axial length for untreated (blue dots, n = 300) and 
EDTA-treated (pink dots, n = 367) cells. Solid and dashed black lines, best 
fits of the mechanical model used to calculate the bending rigidities. 

f, Mean bending rigidities of untreated and EDTA-treated cells. Error bars, 
95% confidence intervals. P value calculated from the confidence intervals. 


Finally, we hypothesized that the strength of the outer membrane 
would affect the survival of L-forms, viable wall-less cells that are pro- 
duced by inhibiting cell-wall biosynthesis**. All chemical and genetic 
perturbations that reduced outer-membrane stiffness also markedly 
reduced the number of viable L-forms generated after inhibiting wall 
synthesis (Fig. 4c, Extended Data Fig. 7e). During L-form generation 
in imp4213 cells, which yielded no viable L-forms (Fig. 4c), cells always 
lysed as soon as the cytoplasm blebbed out of the cell wall, which is 
precisely when turgor pressure is shifted entirely to the outer membrane 
(Supplementary Video 5), consistent with previous measurements 
showing that outer membrane integrity affect the dynamics of blebbing 
and lysis during antibiotic treatment*’. In summary, the mechanical 
properties of the outer membrane are critical for cell survival during 
osmotic fluctuation and L-form proliferation. 

Our conclusions revise our understanding of the physiology of both 
the cell wall and the outer membrane: whereas the former determines 
cell shape and the latter provides a selective chemical barrier, both are 
important mechanical elements in Gram-negative bacteria. Our finding 
that the outer membrane is under negligible or slightly compressive 
load during steady-state growth (Fig. 2b, Extended Data Fig. 3g) may 
provide a mechanism for the formation of outer-membrane vesicles, 
which could bud spontaneously under this circumstance. However, 
steady-state growth is unlikely to be common for E. coli: in the gut, bac- 
teria frequently encounter osmotic fluctuations and dynamic mechan- 
ical forces, which will constantly engage the load-bearing nature of the 
outer membrane. Cell-wall biosynthesis has long been an antibiotic tar- 
get because of the role of the cell wall in protecting the cell from osmotic 
lysis, and L-forms have been implicated in antibiotic persistence”. 
Hence, our finding that the outer membrane shares mechanical load 
with the cell wall, and is therefore critical for L-form proliferation, has 
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Fig. 4 | Undermining outer-membrane integrity reduces survival during 
mechanical perturbation and L-form proliferation. a, Cell lysis versus 
time under chemical perturbation to outer membrane. Shock, 400-mM 
oscillations with a 2-min period. No cells died in the absence of both 
oscillatory shock and outer membrane perturbation (n= 1,000 cells). tas, 
time at which 25% of cells had lysed. b, Time at which 25% of cells had 
lysed under chemical or genetic perturbations (ratio to the untreated wild 
type). c, Concentration of viable L-forms in overnight cultures under 
chemical or genetic perturbations to the outer membrane (ratio to the 
untreated wild type, one experiment). 


clear consequences for antibacterial therapy. Further research should 
elucidate the interdependence between the chemical and mechanical 
roles of the outer membrane. In particular, it is notable that LPS, which 
is a potent antigen that stimulates the mammalian immune system, 
thereby endangering pathogenic bacteria, also provides mechanical 
integrity to bacteria and therefore protects them. 
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Any Methods, including any statements of data availability and Nature Research report- 
ing summaries, along with any additional references and Source Data files, are available 
in the online version of the paper at https://doi.org/10.1038/s41586-018-0344-3. 
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METHODS 


The experiments were not randomized. The investigators were not blinded to allo- 
cation during experiments and outcome assessment. 

Bacterial strains and culture. Bacterial strains used in this study are listed in 
Supplementary Table 1. Three wild-type E. coli genetic backgrounds were used: 
MG1655, MC4100 and AB1133. We tested the effect of the imp4213 allele in the 
MG1655 and MC4100 backgrounds. The AompA, Apal, and Alpp deletions were 
tested in the MC4100 background. The O8 antigen was expressed in the AB1133 
background (ATM378)”. Strain KC427 (MG1655 imp4213 carB::Tn10) was con- 
structed using P1 transduction to transfer the imp4213 mutation from MC4100 
to MG1655 cells using NR693 lysate!®. Bacteria (including non-E. coli species) 
were grown in lysogeny broth (LB), Lennox formulation (5 g1~! NaCl) or minimal 
medium M9 with 20mM glucose, overnight in a rotary shaker at 37 °C. 
Statistical significance. Under the hypothesis that the outer membrane can bear 
large mechanical loads, we expected that chemical or genetic perturbations to 
the outer membrane would result in large changes in the deformation of the cell 
envelope under a given load. We therefore conservatively calculated the number 
of cells required to distinguish such a change using a two-sided Student's t-test to 
be n = 44 to obtain a power of 3=0.9 with statistical significance P=0.05 and 
effect size with Cohen’s d= 0.5. 

Imaging in microfluidic devices. Cells were imaged on a Nikon Eclipse Ti-E 
inverted fluorescence microscope with a 100x (NA 1.40) oil-immersion objective. 
Images were collected on a DU885 electron-multiplying charge-coupled device 
camera (Andor) using |.Manager v.1.4”°. Cells were maintained at 37°C during 
imaging with an active-control environmental chamber (HaisonTech). 

Overnight cultures were diluted 100-fold into 1 ml of fresh LB and incubated 
for 2h with shaking at 37°C, except in experiments in minimal medium, for 
which overnight cultures were diluted into M9 with 20 mM glucose and incu- 
bated for 8h. Cells were imaged in B04A microfluidic perfusion plates (CellASIC) 
and medium was exchanged using the ONIX microfluidic platform (CellASIC). 
Plates were loaded with medium pre-warmed to 37°C. Cells were loaded into the 
plate, which was incubated at 37°C, without shaking, for 30 min before imag- 
ing. The cell envelope was stained with wheat germ agglutinin-AlexaFluor488 
(WGA-AF488, Life Technologies), the fluorescent D-amino acid HADA (gift from 
the Brun and Vannieuwenzhe Laboratories, University of Indiana), and/or FM 4-64 
(Thermo Fisher). For staining of the cell wall, WGA-AF488 or HADA was added 
to the loading well to a final concentration of 10j1g ml”! or 1 mM, respectively, 
before loading cells into the imaging chamber. For staining of the outer membrane, 
FM 4-64 was added to the loading and perfusion medium to a final concentration 
of 2\.g ml! and medium containing FM 4-64 was perfused for 3h before imaging 
to saturate the polydimethylsiloxane of the microfluidic chamber with the dye. 
The osmolarity of the growth medium or phosphate-buffered saline (PBS) was 
modulated with sorbitol (Sigma). 

For plasmolysis and lysis and oscillatory osmotic-shock experiments to meas- 
ure the effect of chemical perturbations (EDTA and FM 4-64 labelling) on cell 
stiffness, MG1655-based strains were stained with WGA-AF488; non-MG1655 
strains stained poorly with WGA-AF488 (data not shown). For plasmolysis and 
lysis experiments, E. coli MC4100- and AB1133-based strains, Pseudomonas aerug- 
inosa, and V. cholerae were stained with HADA. For oscillatory osmotic shocks 
of non-MG1655 strains, HADA photobleached too quickly for tracking over 
many frames (data not shown); hence, we stained and tracked cell envelopes with 
FM 4-64. As FM 4-64 itself increased the amplitude of oscillations in MG1655 cells 
(Fig. 2j), ratios between the amplitudes of each FM 4-64-stained mutant and the 
FM 4-64-stained wild-type parental strain were reported (Fig. 2)). 

During plasmolysis and lysis experiments, cells were allowed to grow for 5 min 
in medium in the imaging chamber before being plasmolysed using medium with 
3M sorbitol and exposed to medium with 3 M sorbitol and 5% N-lauroyl sarcosine 
5 min later. For plasmolysis and lysis in PBS, cells were loaded into the imaging 
chamber in LB, which was exchanged for PBS with 70 mM sorbitol (to balance the 
osmolarity of PBS with LB) for 10 min, then exchanged with PBS and 3 M sorbitol 
for 5 min, and finally exchanged using PBS with 3 M sorbitol and 5% N-lauroyl 
sarcosine. For plasmolysis and lysis in the presence of EDTA, 10 mM EDTA was 
included in the plasmolysis and lysis media. 

To measure the rest length of the outer membrane, we plasmolysed cells 
stained with WGA-AF488 and FM 4-64, as described above. We then exchanged 
the plasmolysis medium with the same medium containing 1 mg ml! lysozyme 
(Thermo Fisher). After approximately 15 min, cells began to lose their rod-like 
shape and lyse. Upon lysis, when released from the constraint of the cell wall, the 
outer membrane of many cells collapsed into many small vesicles. However, the 
outer membranes of many other cells maintained their topology and their size 
could be measured (see below). 

When EDTA was applied to cells after plasmolysis (Fig. 2c), for reasons we do 
not understand, a larger fraction of cells quickly recovered from plasmolysis and 
swelled. Pre-treatment with FM 4-64 resulted in a similar de-plasmolysis (data 


not shown), indicating that chemical perturbation of outer membrane integrity is 
somehow responsible. Cells that de-plasmolysed were not included in population- 
averaged contraction measurements (Fig. 2d-f). 

For oscillatory osmotic shocks, cells were allowed to grow for 5 min in medium 
in the imaging chamber before being subjected to 100-mM oscillatory osmotic 
shocks by switching between LB and LB with 100 mM sorbitol. When adding 
EDTA during oscillatory shocks, two types of experiments were performed. For 
illustrative purposes, in one experiment cells were left untreated for ten cycles, 
and then treated with EDTA for ten cycles (Fig. 2h, i). In a complementary set of 
experiments, EDTA was added during the first ten cycles (but not before); this 
experiment was used to calculate the amplitude ratio for EDTA treatment (Fig. 2)). 

To measure lysis curves (Fig. 4a, Extended Data Fig. 7a-c), cells were loaded 
into the microfluidic chip as described above, and imaged in LB for 5 min before 
beginning chemical perturbation and/or oscillatory osmotic shock. A cell was 
considered to have lysed when it lost phase contrast and, in the case of oscillatory 
shocks, cell size ceased to oscillate. In the absence of shocks, most strains tested 
showed no lysis after 2h of growth in the microfluidic chamber; Apal and MG1655 
imp4213 cells showed a small degree of lysis (2.0% and 3.5%, respectively) after 2h 
of steady-state growth in LB. 

Cell-growth tracking and analysis. Custom MATLAB software was written to 
automatically track cells stained with WGA-AF488 or FM 4-64, as in previous 
studies!°. In plasmolysis and lysis experiments in which cells were stained with 
HADA, the signal-to-noise ratio was not high enough to track cell-wall contours 
automatically, so cell lengths (immediately before plasmolysis, immediately after 
plasmolysis and 20 min after detergent treatment) were measured manually in FIJI. 

To calculate outer-membrane rest lengths, we used a custom MATLAB algo- 
rithm to manually trace cell-envelope contours before plasmolysis, after plasmo- 
lysis and after lysozyme-induced lysis. In the first two states, cell length and width 
were automatically calculated. After lysozyme treatment, we calculated the length 
of the outer membrane as if it were rod-shaped with a width equal to the width of 
the cell before plasmolysis, as follows. We first calculated the surface area of the 
turgid cell envelope, Aturgia, from its contour by assuming rotational symmetry 
around its long axis. After lysis, the outer membrane adopted an amorphous mor- 
phology and remained trapped in the microfluidic chamber with height equal to 
the width of the cell before lysis, w. Therefore, for each cell we calculated the ‘rest 
surface area’ of the outer membrane, Aom, by first measuring the arc length of the 
cell contour, s, and the area enclosed within this contour, a, and then applied the 
equation Aom=2a + ws. Finally, to calculate the rest length of the outer membrane, 
lom, defined as the length of the outer membrane if it had surface area equal to 
Aom and was rod-shaped with a cell width equal to w, we used the equation lom = 
lurgid + (Aturgid — Aom)/(mw). This equation indicates that the difference between 
the rest length of the outer membrane and the length of the turgid cell is equal to 
the length of a cylindrical section with area (Aturgia — Aom) and radius w/2. 

To calculate the amplitude of length oscillations during oscillatory osmotic 
shocks, cells were tracked using custom MATLAB algorithms. First, cell-wall 
lengths (/) were automatically tracked (Extended Data Fig. 5f, g), and the elongation 
rate (¢ = d(In/)/dt) was calculated for each cell (Extended Data Fig. 5h). The effec- 
tive population-averaged length was calculated by integrating the population- 
averaged elongation rate over time!” (Extended Data Fig. 5i): 


ty 
lag = f edt +1 
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in which | is the mean initial cell length. The effective population-averaged length 
was then smoothed with a mean filter with window size equal to the period of oscill- 
ation (Extended Data Fig. 5i), and subtracted from the unsmoothed trace to obtain 
the deviation of the length oscillations around the smoothed trace (Extended Data 
Fig. 5j). The peak-to-peak amplitude was calculated for each cycle (Extended Data 
Fig. 5k). The mean amplitude was calculated by averaging the peak-to-peak ampli- 
tude over cycles. Uncertainty was estimated as the s.d. of the mean amplitude 
over cycles. 

The time scale of the loss of cytoplasmic contents (cytoplasmic GFP or ribosomal 
S2-YFP) and the time scale for cell-wall expansion during lysis were calculated 
using custom MATLAB algorithms. The integrated fluorescence intensity in each 
cell was computed over time during plasmolysis and lysis experiments. The dura- 
tion of efflux (or the time scale of expansion during lysis) was measured manually 
by selecting time points immediately before and after efflux (or lysis). 
Outer-membrane release assay. An overnight E. coli MG1655 culture was diluted 
100-fold into 100 ml fresh LB and incubated with shaking at 37°C for 2h. The 
culture was aliquoted into ten 15-ml tubes and centrifuged (Eppendorf 5804 R) 
at 3,000g for 10 min to pellet cells. The supernatant was decanted and each pellet 
was resuspended in 1 ml PBS and transferred to ten 1.5-ml tubes. The suspensions 
were centrifuged (Eppendorf 5415 D) at 15,000g to pellet cells. The supernatant 
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was removed and each pellet was resuspended in 10011 PBS. To these suspen- 
sions, 10011 of detergent or EDTA was added to the specified final concentrations 
and incubated at room temperature for 10 min. Cells and sacculi were pelleted by 
centrifuging (Beckman Coulter Optima Max-XP, Rotor TL-100) at 100,000 r.p.m. 
for 10 min at 4°C. The supernatant was removed for analysis. LPS was detected 
in serial dilutions of the supernatant using the Pierce Limulus Amebocyte Lysate 
(LAL) assay (ThermoFisher). After performing this reaction, absorbance at 410 nm 
was measured using an M200 96-well plate reader (Tecan). We found that detergent 
above 0.05% interfered with the LAL reaction (data not shown) and therefore we 
diluted supernatants from the detergent-treated samples 1,000-fold into 0.05% 
detergent before measuring LPS. 

Outer-membrane proteins in the supernatant were detected with immunob- 
lotting using an anti-OMP antiserum generously provided by the Thomas Silhavy 
Laboratory at Princeton University. Supernatant was added to 1x NuPAGE LDS 
sample buffer (Invitrogen). Proteins were separated by electrophoresis on a 4-12% 
Bis-Tris gel (Invitrogen) and transferred to a nitrocellulose membrane. After block- 
ing, membranes were probed with 1:30,000 anti-OMP antiserum and 1:10,000 
donkey anti-rabbit 800CW (LiCor Biosciences). Fluorescence antibody-bound 
proteins were detected with an Odyssey Imager (LiCor Biosciences). 

Analysis of peptidoglycan composition. Ultra performance liquid chromatogra- 
phy (UPLC) samples were prepared as previously described’. Overnight cultures of 
E. coli MG1655 were diluted 1:200 in 250 ml LB and grown at 37°C to an OD¢o0nm 
of 0.7. Cultures were centrifuged at 5,000g for 10 min at room temperature and the 
resulting pellet was suspended in 3 ml of either LB, LB with 3 M sorbitol, LB with 
3M sorbitol and 10mM EDTA or LB with 3 M sorbitol and 5% N-lauroyl sarcosine. 
All samples that contained 3 M sorbitol in the resuspension solution were allowed 
to sit at room temperature for 20 min after resuspension. Cell suspensions were 
then lysed by boiling in SDS for 3h. Lysed cell suspensions were ultracentrifuged 
at 400,000g to purify sacculi, which were digested with muramidase into muropep- 
tides. Samples were pH-adjusted and injected onto a Waters H Class UPLC system 
equipped with a BEH C18 1.7-1m column and eluted with sodium phosphate 
buffers. Peaks were quantified and identified as particular muropeptide species 
from their elution times”*, from which the crosslinking density and glycan-strand 
length were calculated as previously described’. 

Derivation of a mechanical model of the cell envelope. Cell-wall length contrac- 
tions upon plasmolysis (¢p), upon lysis (<)) and total contraction (€;) were defined 
as: Ep =(h, — Lh)/y x 100, ey = (Ly — lew) /lew X 100 and e¢= (hh — Lew)/Tew X 100, 
respectively, in which Jj is the length before plasmolysis, J, is the length after plas- 
molysis, and |.w is the length after lysis. We modelled the cell wall-outer membrane 
complex as two linear springs in parallel with spring constants key and kom; respec- 
tively, and rest lengths /-w and Jom, respectively (Fig. 1a). Although the mechanical 
properties of the cell envelope have been measured to be nonlinear by atomic force 
microscopy’, envelope contraction upon hyperosmotic shock was approximately 
linear with respect to shock magnitude up to the maximum contraction caused by 
large shocks (Extended Data Fig. 1), motivating our use of a linear spring-based 
model for estimating outer-membrane stiffness. Initially, the springs are subjected 
to an applied force, F, representing turgor pressure, and have initial length J}. When 
the force is released (when the cell is plasmolysed), the springs relax to length bh. 
The force balance equation is: 


Koy (I,—loy) on kom (1,—-lom) =0 (2) 
We substitute for J, using the definition of =, which yields: 


& 
kom = yt he 
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Using the experimental finding that lom =) =/.w(1 + €t) (Fig. 2b, Extended Data 

Fig. 3g) and substituting the identity ¢,=(1 + €p)(1 + €)) — 1 yields lom=ew(1 + €p) 
(1+ 4). Finally, substituting this expression into equation (3) yields equation (1) 
of the main text. 
AFM measurements of cell stiffness. The day before AFM, poly-L-lysine-coated 
glass slides were prepared by boiling 22-mm square cover glasses in 2% Micro-90 
detergent. Cover glasses were rinsed in ethanol and baked in a dry oven at 80°C for 
>1h. Cover glasses were glued onto microscope slides using super glue. 

On the day of measurement, 101] of 0.01% poly-L-lysine were spotted onto 
the centre of the cover glass and dried for 1h. A silicone well (Grace Biolabs) was 
affixed to the cover glass around the poly-L-lysine spot. An overnight culture of 
E. coli was diluted 100-fold into LB and incubated at 37°C for 1.5h. One millilitre 
of culture was spun down in a table-top microcentrifuge for 1 min at 15,000g and 
resuspended in 1 ml de-ionized water. This suspension was then spun down and 
concentrated in 1001] de-ionized water. This suspension was added to the well 
on the poly-L-lysine-coated slide. The slide was incubated at room temperature 
for 15 min. The well was rinsed with de-ionized water three times to remove cells 


LETTER 


that were not stuck to the surface. The well was then immersed in 100,11 M9 with 
20mM glucose. 

AFM experiments were performed on a Bioscope Resolve (Bruker) with MLCT- 
BIO-E (Bruker) cantilevers with a nominal spring constant of 0.1 N m7! (for EDTA 
experiments) or PFQNM-LC (Bruker) cantilevers (0.08 N m~!). Cantilevers were 
calibrated using the thermal noise method*°. We measured the deflection sensitiv- 
ity (nm V~') from the slope of a force curve recorded on a glass surface. The res- 
onance frequency and q-factor of the cantilever were determined by a Lorentzian 
fit of the power spectral density of the cantilever recorded using the thermal tune 
option in the Nanoscope Analysis v.1.8 (Bruker) software. From these parameters, 
the spring constant was calculated. 

A polydimethylsiloxane calibration sample with known stiffness of 3.5 MPa was 
measured using force volume imaging mode in an area of 1 x 1pm ata resolution of 
32 x 32 pixels at 0.5-Hz ramp rate and 600-nm ramp size (Extended Data Fig. 6d). 
Force curves were recorded to a maximal deflection error of 5nm, and analysis of 
the force curves was accomplished using Nanoscope Analysis v.1.8. For analysis 
of the polydimethylsiloxane stiffness measured with PFQNM-LC cantilevers, the 
baseline correction of the force curve was performed using 10-80% of the ramp 
size, with a tip radius set to 65 nm, the tip half angle to 18°, and Poisson's ratio of 
0.3. The approach direction of the force curve was used to analyse the stiffness, 
assuming a Hertz contact model as a modulus fit model for the interaction of a 
sphere with a surface, resulting in a mean stiffness of 3.41 + 0.73 MPa (Extended 
Data Fig. 6d). These parameters were then used to analyse force curves recorded 
on immobilized E. coli with a PFQNM-LC cantilever. For measurements with 
MLCT-BIO-E cantilevers, the tip radius was set to 20nm. 

For each cell measured, an image of the immobilized cell was recorded to con- 
firm firm attachment and to position the cantilever at the centre of the cell before 
recording force curves. Three consecutive force curves with z-range of 800 nm were 
recorded at a rate of 0.5 Hz to a maximum deflection of 5 nm. Force curves were 
further analysed using NanoScope Analysis v.1.8. For each cell, the mean stiffness 
was calculated from the three force curves. In all cases, individual slides or dishes 
were measured for no more than 1h. 

To compare our reported cell stiffness values (akin to Young’s modulus of the cell 
as if it was a single bulk material, previously shown to be dependent on envelope 
stiffness*) to previous measurements’, we used the PDMS calibration to compute 
the relationship between stiffness values in MPa and in N m7! using the Nanoscope 
analysis software. The previously reported cell stiffness value® (0.017 N m_') con- 
verts to a Young’s modulus value of 0.5 MPa for our experimental methodology, 
which compares very favourably with our data in Fig. 3b. 

For measuring the effect of FM 4-64 on cell stiffness, single force curves were 
taken every minute. After several minutes, a concentrated solution (41g ml!) of 
EM 4-64 was added to a final concentration of 2j.g ml“!. 

For measuring the effect of EDTA on cell stiffness, instead of supported cover- 

glass slides, we adhered cells to glass-bottom dishes (Ted Pella, 14026) that had 
been spotted with poly-L-lysine as described above. After cells adhered, dishes 
were immersed in 4ml M9 with glucose medium. For each cell, single force curves 
were recorded at 1-min intervals. After 5-10 min, the EDTA concentration was 
increased stepwise every 10 min to final concentrations of 5mM, 10 mM, and 
20 mM by adding 1 ml of appropriately concentrated stock EDTA solution. 
Microfluidic assay of bending stiffness. The bending rigidity of E. coli cells was 
measured using a microfluidic assay essentially as described*®”". E. coli MG1655 
was transformed with a plasmid (pDB192) containing sulA under an isopropyl 
8-p-1-thiogalactopyranoside (IPTG)-inducible promoter. Cells were grown over- 
night in 2 ml LB containing 30,.g ml“! kanamycin and 501g ml“! ampicillin. 
IPTG was added to the medium to induce sulA throughout cell growth in the 
microfluidic flow chamber. Deflection of cells under fluid flow was monitored on 
a Zeiss Axiovert 100 microscope (Zeiss) equipped with a 60 x oil objective. Images 
were collected with an Andor iXon 3 EMCCD (Andor) using .Manager v.1.4”°. 
Deflection of the cells was determined using a custom Igor Pro (WaveMetrics) 
image-analysis algorithm. 
Quantification of L-form viability, physiology and growth. Overnight cultures of 
the appropriate strains were diluted 1:100 into LFLB (LB supplemented with 3.6% 
sucrose and 10 mM MgSO,). Cultures were incubated at 37°C for 1h, at which 
point cefsulodin was added to a final concentration of 60,.g ml~!. As appropriate, 
chemical treatments (EDTA and FM 4-64) were also added at this point. Cells were 
incubated for 12h with shaking at 30°C. Five microlitres of serial tenfold dilutions 
were plated on LFLB with cefsulodin and LB with cefsulodin plates. Plates were 
incubated at 30°C for 24h. Plates were imaged with a Canon EOS Rebel T5i DSLR 
camera and colony-forming units per ml were counted manually. MC4100-based 
L-forms formed smaller colonies that were not visible with the DSLR camera, 
and thus spots from the plates were excised and imaged with an inverted micro- 
scope (Nikon Eclipse Ti-E) at 10 magnification. For each L-form culture except 
AB1133, spotting onto LB with cefsulodin plates reduced yield by at least an order 
of magnitude relative to LFLB with cefsulodin (data not shown). 
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L-forms produce LPS at levels approximately equal to untreated cells (6.6 x 10°° 
and 7.2 x 10~° endotoxin units per cell for untreated cells and L-forms, respec- 
tively). To measure this, 1 ml of overnight culture was washed twice in PBS. 
N-lauroyl sarcosine was added to a final concentration of 5%, and cell suspensions 
were incubated at 90°C for 10 min to lyse cells and dissolve their membranes. LPS 
was detected in tenfold serial dilutions of this solution using the Pierce Limulus 
Amebocyte Lysate assay (Thermo Fisher). 

To image L-form generation from imp4213 cells, a 5-ml culture was grown 
overnight in LB at 30°C. The overnight culture was diluted 100-fold into LB and 
incubated at 30°C for 2h. Five microlitres of this culture were spotted onto an LFLB 
with cefsulodin agarose pad and imaged every minute at 30°C. 

Code availability. All custom code used for the current study is available from the 
corresponding author on reasonable request. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 


Data availability. The datasets generated during the current study are available 
from the corresponding author on reasonable request. 
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Extended Data Fig. 1 | Cell-wall deformation is approximately linear 


with respect to hyperosmotic shock over a large range. Population- 
averaged contraction of cell-wall lengths versus hyperosmotic shock 


magnitude (n= 92, 73, 53, 71, 58, 11, 31, 47 cells). Data are mean 4 


t s.d. 


The dotted line is the linear best fit for experimental data for shocks 
with magnitude < 800 mM. The plateau after 800 mM demonstrates 
that the cell envelope has reached its minimum length upon large, 3M 
hyperosmotic shocks. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | Detergent or EDTA treatment causes dissolution 
of the outer membrane. a, Description of the outer-membrane release 
assay used to measure the effect of detergent and EDTA on the outer 
membrane. See Methods section ‘Outer membrane release assay’ for a 
more detailed description. b, Absorbance of 410-nm light (left axis) in 
the supernatant from detergent-treated cell suspensions after performing 
the LAL LPS detection assay. Absorbance is correlated with the amount 
of LPS in the sample, and is linear below 1,000 EU ml"! (right axis). This 
experiment was performed once. c, Immunoblot with an antibody cross- 
reactive with several outer-membrane proteins (for example, LamB and 
OmpA) of the supernatant of cells treated with various concentrations of 
N-lauroyl sarcosine (detergent). This experiment was performed once. 

d, Absorbance of 410-nm light (left axis) in the supernatant from EDTA- 
treated cell suspensions after performing the LAL LPS detection assay. 
Absorbance is correlated with the amount of LPS in the sample, and is 
linear below 1,000 EU ml! (right axis). This experiment was performed 
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once. e, Montage of a representative plasmolysed cell expressing cytosolic 
GFP lysing upon detergent treatment. Cell expansion corresponds to 

the time of GFP disappearance. Data are mean + s.d. from 32 cells in 

one experiment. f, Duration of swelling (left), of release of cytosolic 

GFP (centre), and of release of ribosomes upon lysis of plasmolysed 

cells (n = 22, 32, 12 cells, respectively; one experiment each). Error bars, 
+1 s.d. The duration of swelling is approximately equal to the duration 
of the release of cytoplasmic contents. g, Ultra performance liquid 
chromatography of peptidoglycan composition of cell walls purified from 
untreated (red), sorbitol-treated (yellow), detergent-treated (green), and 
EDTA-treated (blue) cells. Mean glycan strain lengths (i) and percentage 
of various peptidoglycan subunit eluents (ii) are the same across all 
treatments. Dap-Dap, Dimers formed with double diaminopimelic acid 
bonds as opposed to diaminopimelic acid—p-alanine bonds; lipoprotein, 
subunits bound to Lpp. n=4, 2, 2 and 1 sacculi preparations for untreated, 
sorbitol-treated, detergent-treated and EDTA-treated cells, respectively. 
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Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | Detergent treatment causes contraction of the 
cell wall across a range of conditions and organisms. a, Length of the cell 
wall versus time during hyperosmotic shock (3 M sorbitol, solid arrow) 
and subsequent treatment with detergent (5% N-lauroyl sarcosine, dotted 
arrow), and washout of the detergent (switching back to 3 M sorbitol, 
dashed arrow) for four representative cells (n = 260 cells). The cells briefly 
swelled upon washout, then relaxed to the pre-washout rest length. We 
speculate that the brief swelling is caused by a hypoosmotic shock during 
washout (lowering the concentration of detergent). That is, we propose 
that the detergent has a slow diffusion constant through the cell wall, and 
therefore washing it out imposes a transient hypoosmotic shock, before 
detergent within the cell diffuses out of it. When it does, the cell wall 
relaxes again to its rest state, demonstrating that cell-wall contraction 
upon detergent treatment is not due to compaction by the detergent. 

b, Lengths of the cell wall versus time during hyperosmotic shock (3 M 
sorbitol, solid arrow) and subsequent addition of detergent (5% N-lauroyl 
sarcosine, dotted arrow) for four representative B. subtilis cell chains 
(n=112 cell chains). Rather than contract upon detergent treatment, 

the cell walls re-extend. We hypothesize that the cell wall is in a highly 
compressed state after 3-M hyperosmotic shock because the cytoplasm is 
exerting an inward force via connections between the plasma membrane 
and cell wall. When the plasma membrane is dissolved with detergent, this 
force is relieved and the cell wall extends to its rest length. c, Lengths of 
the cell wall of B. subtilis cell chains versus time during a 1-M oscillatory 
osmotic shock, which caused cell lysis (for example, red arrows), and 
treatment with detergent (5% N-lauroyl sarcosine, dotted arrow; n= 93 
cell chains). The cell walls expanded only slightly upon detergent addition, 


LETTER 


indicating that it is lysis rather than detergent treatment that causes the 
re-extension in (b). d, Contraction upon plasmolysis, subsequent lysis and 
total contraction for three environmental conditions: i) LB (n=79, 56 and 
56 cells, respectively), ii) M9 (n= 46, 52 and 38 cells, respectively) and 

iii) cultured in LB and transferred to PBS before plasmolysis (n = 95, 86 
and 94 cells, respectively). Error bars indicate +1 s.d. e, Contraction upon 
plasmolysis, subsequent lysis and total contraction for three wild-type E. 
coli strains: i) MG1655 (n=79, 56 and 56 cells, respectively), ii) MC4100 
(n= 48, 48 and 48, respectively) and iii) AB1133 (n =56, 56 and 56 cells, 
respectively). Error bars indicate +1 s.d. f, Contraction upon plasmolysis, 
subsequent lysis, and total contraction for i) P. aeruginosa (n= 12, 12 

and 12 cells, respectively) and ii) V. cholerae (n = 36, 36 and 36 cells, 
respectively). Error bars indicate +1 s.d. g, The mean ratio between the 
rest length of the outer membrane, Jom, and the length of the cell envelope 
in the fully turgid state, ), for three wild-type E. coli strains (n= 12, 10 and 
10 cells for MG155, MC4100 and AB1133, respectively). The rest length 
of the outer membrane is approximately equal to the length of the fully 
turgid cell for all three strains. Error bars indicate +1 s.d. The P value was 
calculated using a Student’s two-sided t-test. h, Mean contraction upon 
plasmolysis, subsequent lysis, and total contraction for wild-type AB1133 
(n= 56 cells) and an isogenic strain expressing the O8 antigen (n=55 
cells). The ratio for cells expressing the O8 antigen is very large because 
the contraction upon lysis for the parental wild-type strain (AB1133) was 
close to zero, and adding the antigen markedly increased contraction. 
Error bars indicate +1 s.d. P values were calculated using a Student's two- 
sided t-test. 
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Extended Data Fig. 4 | Impermeable molecules within the cell wall 
after lysis cause residual turgor pressure. a, Increasing detergent 
concentration caused an approximately proportional increase in the 
mean cell-wall contraction upon lysis and the mean total contraction in 
E. coli MG1655 cells. Each point represents one experiment. The number 
of cells for each experiment is given in Supplementary Table 2. 

b, Representative DAPI-stained E. coli cells before (left) and after (right) 
lysis shown in phase contrast and epifluorescence, demonstrating that 
DNA is not retained in the lysed cells. The experiment was performed 
once. c, Representative E. coli cells expressing a fluorescent protein fusion 
to the S2 ribosomal protein before (left) and after (right) lysis shown 

in phase contrast and epifluorescence, demonstrating that ribosomes 

are not retained in lysed cells. The experiment was performed once. 

d, Representative E. coli cells expressing cytosolic GFP before (left) 


Cytoplasmic phase intensity post-lysis (A.U.) 


and after (right) lysis shown in phase contrast and epifluorescence, 
demonstrating that GFP is not retained in lysed cells. The experiment was 
performed once. e, Representative E. coli cells expressing a fluorescently 
tagged version of MreB after lysis shown in phase contrast (left), 
epifluorescence (centre) and in overlay (right), demonstrating that MreB 
is retained within most lysed cells, but that cells with weak phase density 
after lysis retain low levels of MreB (arrows). f, Cumulative fluorescence 
intensity of MreB-sfGFP versus the average phase-contrast intensity 
within the cell after lysis (n = 162 cells). Cells with higher phase density 
have lower intensity. g, Cell-wall contraction upon lysis versus average 
phase-contrast intensity within the cell after lysis (n = 46 cells). h, Total 
contraction during plasmolysis and lysis versus average phase-contrast 
intensity within the cell after lysis (n = 46 cells). 
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Extended Data Fig. 5 | EDTA weakens the E. coli cell envelope. a, 
Length of the cell wall versus time for 7 representative B. subtilis cell 
chains during treatment with detergent with treatment using detergent 
and 10mM EDTA (n=68 cell chains). Although detergent caused lysis, 
subsequent addition of EDTA did not affect cell-wall rest length. b, 
Lengths of the cell wall of B. subtilis cell chains versus time during 1-M 
oscillatory osmotic shocks, which caused cell lysis (for example, red 
arrows), with treatment using 10 mM EDTA (dashed arrow; n = 127 

cell chains). EDTA did not affect the rest length of the cell walls. c, Top, 
length of the B. subtilis cell wall versus time through a hyperosmotic shock 
(3M sorbitol, solid arrow) and subsequent treatment with 10 mM EDTA 
(dotted arrow) for four representative cell chains (n = 61 cell chains). Cell- 
wall length did not decrease after detergent, as it did for E. coli. Bottom, 
micrograph of a two-cell chain expressing cytosolic (strain HA405, left) 
and a kymograph showing fluorescence intensity along the dotted red line 
during the experiment in the top graph. The cell chain in the kymograph 
corresponds to the bottom-most length trace in the top graph. Red arrows 
demonstrate that the discrete increases in length observed after EDTA 
treatment correspond to cell-lysis events, when fluorescence within the 
cells begin to decrease. d, Population-averaged E. coli cell-wall length 
contraction upon EDTA application after plasmolysis increased with 
increasing concentration of EDTA (n= 131, 193, 225, 138, 81, 94, 36, 28, 
72 cells, respectively). Error bars indicate +1 s.d. e, Length of the cell 

wall versus time during hyperosmotic shock (3 M sorbitol, solid arrow) 
and subsequent treatment with 10 mM EDTA and 50 mM MgCl, (dotted 
arrow) for representative E. coli cells (n=91 cells). f, Length of the cell 


untreated WT stiffness 


walls of representative E. coli cells during 100-mM oscillatory shocks with 
2-min period (n= 243 cells). g, Length of the cell walls of representative 
E. coli cells during a 100-mM oscillatory shock with 2-min period and 
10mM EDTA (n = 284 cells). h, Population-averaged elongation rate of 
the E. coli cell wall during 100-mM oscillatory shocks with 2-min period 
for untreated (black line) and 10 mM EDTA-treated cells (n = 284 cells). 

i, Effective population-averaged cell length (Jef), calculated by integrating 
the population averaged elongation rate in h during 100-mM oscillatory 
shocks with 2-min period for untreated (black line) and 10 mM EDTA- 
treated cells (n = 284 cells). Dotted lines are the respective time-averaged 
lo using a rolling-window averaging filter with a 2-min window (equal 

to the period of oscillations). j, Deviation of the effective population- 
averaged length in i from the respective time-averaged trace. k, The 

mean amplitude of oscillation was found by averaging the peak-to-peak 
amplitude in j over cycles (n = 10 cycles). Error bars indicate +1 s.d. The 
P value was calculated using a Student’s two-sided t-test. 1, Amplitude 

of cell-wall length oscillations (ratio with respect to the untreated wild 
type; Fig. 2j) versus cell-wall stiffness calculated from plasmolysis-lysis 
experiments (ratio with respect to the untreated wild type; Fig. 2g). 

Solid line, linear best fit for only perturbations to the outer membrane 
(red circles; linear regression: R?=0.71, F=9.7, P= 0.0356). Dashed 

line, best fit when additionally considering perturbations to protein 
linkages between the outer membrane and cell wall (dashed circles; linear 
regression: R? = 0.25, F= 1.4, not significantly different from horizontal). 
For the O8-expressing strain, we conservatively used a stiffness ratio of 1.5 
for the fits. 
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curves taken during the period in which the cell was treated with the stiffness) and Fig. 2d-f (envelope stiffness). For the O8-expressing strain, 
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the cell was treated with each EDTA concentration. Average-force curves Young’s modulus of 3.5 MPa. Top, distribution of measurements of Young’s 
were registered with respect to the onset of force increase as the cantilever modulus across the calibration sample. Red curve is the Gaussian fit to the 
was lowered. b, Stiffness of a representative cell versus time, as measured data. Dashed line is the mean. Top inset, Young’s modulus measurements 
with AFM. At t= 10 min the cell was treated with 2 1g ml“! FM 4-64 were spatially uniform. Bottom, box plot of the distribution of Young’s 
(n=3 cells). c, The ratio of the cell stiffness computed with AFM (Fig. 3c) modulus measurements showing the median stiffness (red line), 25% and 
versus the ratio of envelope stiffness computed from plasmolysis—lysis 75% percentiles (edges of box), extreme bounds (whiskers), and outliers 
experiments (Fig. 2g) across chemical and genetic perturbations. The (red points). 
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Extended Data Fig. 7 | Genetic perturbations to the outer membrane 
render cells vulnerable to mechanical perturbation. a, Cell lysis versus 
time and cycle number during application of 5% detergent (N-lauroyl 
sarcosine, n= 112 cells) and 5% detergent with 400-mM oscillatory 
osmotic shock (n= 100 cells). Cell-lysis rate increased notably during 400- 
mM oscillatory osmotic shocks. b, Cell lysis versus time and cycle number 
during 400-mM oscillatory osmotic shocks with 2-min period for mutant 
E. coli strains with the MC4100 wild-type background (n = 476, 98, 187, 
99, 280 cells for the wild-type, imp4213, AompA, Alpp and Apal groups, 
respectively). c, Cell lysis versus time and cycle number during 400-mM 
oscillatory osmotic shock with 2-min period for E. coli wild-type AB1133 


(n= 273 cells) and ATM378 (AB1133+08, n= 123 cells). d, Time at which 
25% of cells had lysed (ratio to the untreated wild type, Fig. 4b) versus the 
ratio of amplitudes during 100-mM oscillatory osmotic shocks (Fig. 2)). 
The line is the linear best fit (linear regression; R?=0.84, F=19.4, 
P=0.0045). e, Serial dilutions of overnight E. coli L-form cultures spotted 
onto solid media permissive for L-form growth. For chemical treatments, 
10mM EDTA and 21g ml“! FM 4-64 were included in the liquid media 
used to culture L-forms overnight, but not in the solid media onto which 
L-forms were spotted. Mutants (AompA, +08, Alpp, Apal) formed very 
small colonies, and were thus viewed using an inverted microscope to 
ensure accurate counting. 
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Single-cell mapping of the thymic stroma identifies 
IL-25-producing tuft epithelial cells 


Chamutal Bornstein*°, Shir Nevol®, Amir Giladi>®, Noam Kadouri!®, Marie Pouzolles’, Francois Gerbe’, Eyal David!', 
Alice Machado”, Anna Chuprin!, Beata Toth*, Ori Goldberg°, Shalev Itzkovitz*, Naomi Taylor?, Philippe Jay’, 


Valérie S. Zimmermann2, Jakub Abramson!”* & Ido Amit!”* 


T cell development and selection are coordinated in the thymus by a 
specialized niche of diverse stromal populations!-*. Although much 
progress has been made over the years in identifying the functions 
of the different cell types of the thymic stromal compartment, 
there is no comprehensive characterization of their diversity and 
heterogeneity. Here we combined massively parallel single-cell 
RNA-sequencing*®, spatial mapping, chromatin profiling and gene 
targeting to characterize de novo the entire stromal compartment 
of the mouse thymus. We identified dozens of cell states, with 
thymic epithelial cells (TECs) showing the highest degree of 
heterogeneity. Our analysis highlights four major medullary TEC 
(mTEC I-IV) populations, with distinct molecular functions, 
epigenetic landscapes and lineage regulators. Specifically, mTEC IV 
constitutes a new and highly divergent TEC lineage with molecular 
characteristics of the gut chemosensory epithelial tuft cells. Mice 
deficient in Pou2f3, a master regulator of tuft cells, have complete 
and specific depletion of mTEC IV cells, which results in increased 
levels of thymus-resident type-2 innate lymphoid cells. Overall, 
our study provides a comprehensive characterization of the thymic 
stroma and identifies a new tuft-like TEC population, which is 
critical for shaping the immune niche in the thymus. 

The thymus constitutes a specialized lymphoid organ, where immature 
T lymphocytes are educated to recognize foreign antigens, while toler- 
ating self. The T cell ‘educational program involves two central steps, 
which occur in two anatomical compartments of the thymus, the cortex 
and the medulla. Both compartments are characterized by the presence 
of specialized stromal cells, which provide the desired microenviron- 
ment for different checkpoints in T cell development and selection”. 
Cortical thymic epithelial cells (CTEC) coordinate the early stages of 
T cell development and positive selection of thymocytes®. The later 
steps of T cell development, including negative selection, are primarily 
carried out by mTECs!*. 

Although much progress has been made over the years in elucidat- 
ing the function of the different cell types of the thymic stroma, their 
diversity, heterogeneity and molecular pathways are still poorly charac- 
terized. To de novo characterize the entire stromal compartment of the 
thymus, we performed massively parallel single-cell RNA-sequencing 
(MARS-seq)*” of 1,825 non-haematopoietic cells (CD45~) isolated 
from adult mouse thymi (Extended Data Fig. 1). In order to link the 
canonical surface markers to the single-cell RNA-sequencing data, we 
used an index sorting strategy that allowed for retrospective analysis 
of surface markers of each individual cell. We then used the MetaCell 
pipeline to identify homogeneous and robust groups of cells (Methods). 
This analysis showed that the thymic stroma is composed of three major 
lineages, consisting of fibroblasts (Colla1 and Col6a1), endothelial cells 
(Pecam1 and Fit1) and epithelial cells (Epcam and various keratin genes; 
Fig. la-c and Extended Data Fig. 2). Of the three linages, the epithelial 
cells displayed the largest heterogeneity in gene-expression programs 


(Extended Data Fig. 2 and Supplementary Tables 1, 2), suggesting that 
they are more complex and heterogeneous than previously anticipated. 

In order to comprehensively characterize the TEC compartment, 
we sorted additional 1,716 CD45~-EpCAMt single cells (Extended 
Data Figs 1, 3). Clustering analysis combined with two-dimensional 
projection of the epithelial cells from both datasets revealed dozens 
of different TEC subpopulations (Fig. 1d, e, Extended Data Fig. 3 and 
Supplementary Table 2). The TEC subpopulations clustered within five 
major molecular types, each distributed at a distinct position within 
the two-dimensional projection (Fig. 1d and Extended Data Fig. 3). 
Index sorting analysis using the canonical cortical (Ly51) and medul- 
lary (UEA1) markers revealed that only one of the groups corresponds 
to the Ly51*UEA1~ population and expresses cTEC-specific genes, 
including Prss16, Psmb11 and C. tsl, By contrast, the other four TEC 
populations stained positively for UEA1 and had no or low expression 
of Ly51 (Extended Data Fig. 3), suggesting that these cells reside in the 
medulla. This was further validated by immunofluorescence staining 
and single-molecule RNA fluorescence in situ hybridization using a 
panel of markers specific to the individual TEC subpopulations (Fig. 1f 
and Extended Data Fig. 3). Therefore, on the basis of these data, we 
reclassified the mTEC compartment into four major groups (mTEC 
I-IV), reflecting their distinct transcriptional and molecular charac- 
teristics. Specifically, mTEC I is characterized by high expression of 
Itga6 and Sca1 (also known as Ly6a) (Fig. le and Extended Data Fig. 3), 
expression of which have previously been associated with putative TEC 
progenitors’. The mTEC II population is characterized by specific 
expression of the canonical markers of mature mTECs, including high 
expression of Aire, Fezf2, Cd40, H2-Aa or Cd74 (Fig. le and Extended 
Data Fig. 3). mTEC III represents a heterogeneous population express- 
ing several unique genes (Pigr, Ly6d, Spink5, Ivl and Krt10), some of 
which have been linked to a putative population of mTEC that previ- 
ously expressed AIRE (post-AIRE cells)®? (Fig. le and Extended Data 
Fig. 3). Notably, the mTEC IV population does not express any classical 
mTEC or cTEC markers, but rather a unique set of genes such as Lrmp, 
Avil, Trpm5, Dclk1, Gng13, Llcam and Sox9 (Fig. le and Extended Data 
Fig. 3). 

To investigate the dynamics of the different TEC populations during 
early development, we performed MARS-seq analysis of 3,074 sorted 
CD45" EpCAM*¢ single cells isolated from thymi at major develop- 
mental stages: embryonic day 14.5 (E14.5), E18.5 and day 6 after birth 
(Extended Data Figs. 1, 4). Because the developing thymus may have 
additional cell types or states that are not observed in the adult, we 
associated embryonic TEC metacells with adult phenotypes only if a 
large fraction of their cell neighbours was of adult origin (Methods, 
Fig. 1g, Extended Data Fig. 4 and Supplementary Table 3). This 
analysis highlighted the dynamic changes in the TEC compartment 
during thymus organogenesis. Although most of the E14.5 TECs were 
relatively homogenous and expressed a large number of cTEC-specific 
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Fig. 1 | The medulla epithelial compartment has diverse molecular 
functions. a, Clustering analysis of 1,825 thymic stromal (CD45) cells 
from seven biological replicates of 4-6-week-old mice. b, Expression 
of selected marker genes. c, Index sorting tracks showing protein level 
intensity. d, Two-dimensional graphical representation of 2,341 single 
CD45-EpCAM‘ cells separated into five TEC subsets. e, Kernel density 
projection of differentially expressed genes onto the two-dimensional 
graph. f, Immunofluorescence images of thymus sections. Medulla (M) 
and cortex (C) are separated by a dashed line, distinguished by nuclei 


genes, their general transcriptional signature was distinct from adult 
cTECs (Fig. 1g and Extended Data Fig. 4). Specifically, we observed 
progressive downregulation of cell cycle genes and upregulation of 
the MHC-I pathway in the adult compared to the embryonic cTECs 
(Fig. 1h, Extended Data Fig. 4 and Methods). While none of the adult 
mTEC subpopulations were present at the E14.5 stage, mTEC | and II, 
but not mTEC II] and IV, became detectable in the thymus at E18.5 
(Fig. 1g and Extended Data Fig. 4). At neonatal day 6, mTEC I, I and 
IV were present, although with different frequencies than in the adult 
thymus (Fig. 1g and Extended Data Fig. 4). Measuring the percentage of 
proliferating cells, we found that E14.5 TECs are the most proliferative 
cells in the thymus, while after birth most of the TEC proliferation is 
restricted to mTEC I and mTEC II cells (Extended Data Fig. 4). 

To further characterize the newly defined mTEC subtypes, we 
established a new sorting strategy, based on a panel of surface mark- 
ers unique to each population (Fig. 2a). This strategy was validated 
by MARS-seq and qPCR analyses of sorted EpCAM* populations, 
which were gated according to this scheme (Fig. 2b and Extended Data 
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density. Blue, DAPI. 85t is also known as PSMB11; CD49f is also known as 
ITGAG6. Scale bars, 40 um. Images are representative of three independent 
animals with similar results. g, Distribution of the TEC subsets along 

four developmental time points. Grey labels represent cells distinct from 
adult TECs. h, Progression of early cTECs towards adult cTECs. Circles 
represent cTEC metacells coloured by the developmental time points of 
the majority of cells. Axes represent share of each gene module from the 
entire metacell transcriptome. 


Fig. 5). Profiling the putative enhancer regulatory elements (marked by 
H3K4mez2) of the four mTEC populations using indexing-first chroma- 
tin immunoprecipitation followed by deep sequencing” revealed that 
each subset is characterized by a unique set of distal enhancer regions 
with the mTEC IV population showing the most distinct regulatory 
elements (Fig. 2c, d, Extended Data Fig. 5 and Extended Data Table 1). 
In order to determine specific transcription-factor-binding sites, we 
performed an assay of transposase-accessible chromatin and analysed 
peaks of open chromatin within enhancer regions", followed by de 
novo motif finding (Extended Data Fig. 5 and Extended Data Table 1). 
This analysis revealed that accessible enhancer regions in mTEC II cells 
are significantly enriched (P= 107 !°; binomial test) for a binding-motif 
signature of the NF-«B family, correlating with specific expression of 
the Nfkb2 gene in mTEC II cells (Fig. 2d-f and Extended Data Fig. 5). 
By contrast, mT'EC-IV-specific enhancers were significantly enriched 
(P=10~®) for the POU class 2 transcription-factor motif, correlat- 
ing with the specific expression of the Pou2f3 gene in mTEC IV cells 
(Fig. 2d-f and Extended Data Fig. 5). 
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Fig. 3 | Characterization of AIRE-dependent mTEC subsets. a, Total 
expression distribution of stochastically expressed genes across the TEC 
metacells. In the box plots, bars indicate median, boxes are the first- 
third quantiles, whiskers, 5th—-95th percentile and outlier are shown 

as circles. n = 2,341 single cells. b, c, Mean normalized expression of 
representative TRA genes (b), Aire and H2-Aa (c) across TEC metacells. 
d, Venn diagram depicting overlap between stochastically expressed 
genes and an established list of AIRE-dependent and -independent 
TRA genes. Hypergeometric test. n = 388 differentially expressed genes. 
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Fig. 2 | Genetic and epigenetic 
characterization of TEC subsets. a, Gating 
strategy for fluorescence-activated cell sorting 
to isolate mTEC subsets. b, qPCR analysis of 
mTEC subsets. Values represent fold change 
from the sample mean. n = 3 biologically 
independent animals. Data are mean (bars) 
and individual animals (dots). c, Normalized 
H3K4me2 profiles in 100-kb regions around 
differential mTEC gene loci. Differential 
genes are specified by black arrows. Plots 

are representative of four animals from two 
independent experiments. d, Peak intensities 
of 2,302 differential peaks between the four 
mTEC subsets (clusters 1-7; K-means; K =7) 
e, Transcription-factor motif analysis 
showing enrichment of motifs in accessible 
regions within H3K4me2-marked peaks; bars 
indicate motif abundance in H3K4me2 peaks 
clusters 1-7. False-discovery rate-corrected 
binomial test; n = 137 and 1,559 peaks. 

f, Projection of transcription factors onto the 
two-dimensional graph (Fig. 1). 


b 
0.20 Ins2 Cryaa 
_ 0.20 
= 0.10 
5 0.10 
8 | fl og) po tn ane 
% 0:30) gst 0.155 Card 
= 0.20 0.10 
0.10 0.05 
8 9] a1 _all Sects elisa 
om 
x 
® 93 Avp 1.2 lapp 
8 0.2 0.8 
= of 0.4 
04 —=—— Soe (¢)! | —ceemeeee es aE = 
Stochastic genes 
2.0 


CO 


AIRE aches AIRE we 
(P=8.7 x10" 


Csnb 


animals (dots). 


e, Comparison of stochastic gene expression between Aire knockout 


624 | NATURE | VOL 559 | 26 JULY 2018 


© 2018 Springer Nature Limited. All rights reserved. 


Percentage of population (%) 


< 


mTEC Il 


1.8 
1.6 
x14 
1.2 
1.0 
0.8 ; 
1.0 1.4 1.8 
WT 
ee 
e es 
e e 
) % i“ 
e e 
‘\ S \S NY 
.e) S S 
x Ka Ka Ko 
¢ < < 


(KO) and wild-type (WT) cells in TEC subsets. Marginal distributions 

are shown as histograms. Axes represent UMI count per 1,000 UMI, 
normalized to cell numbers. f, Bar plots showing log, fold change between 
TEC subpopulation abundances in Aire knockout and wild-type mice. 
Error bars represent 95% confidence intervals. n = 1,332 knockout and 
1,638 wild-type cells. g, Projection of Csnb onto the two-dimensional 
graph (Fig. 1). h, Percentage of tdTomato-expressing cells in mTEC I-IV. 
n=4 biologically independent mice. Data are mean (line) and individual 
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Fig. 4 | mTEC IV, a new TEC population 
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Given that one of the key functional roles of mTECs is to ectopi- 
cally express and cross-present a plethora of tissue-restricted antigens 
(TRAs)!, we analysed the level of TRA gene expression within the TEC 
compartment. Because the expression of most TRAs is stochastic and 
AIRE-dependent!", we first defined a list of stochastically expressed 
genes in mTECs, based on their high expression variance and low cor- 
relation to other genes (Fig. 3, Extended Data Fig. 6, Supplementary 
Table 4 and Methods). As expected, mTEC II cells expressed the highest 
number of variable and uncorrelated genes (Fig. 3a, b and Extended 
Data Fig. 6). Notably, mTEC III cells showed a high level of stochastic 
gene expression (Fig. 3a—c) in spite of low expression of AIRE. MARS- 
seq analysis of 1,332 CD45" EpCAM¢ single cells from Aire knock- 
out mice, validated that AIRE deficiency almost entirely eliminated the 
expression of the ‘stochastic’ genes within the mTEC II and IH popu- 
lations (87% and 67%, respectively; Fig. 3e and Extended Data Fig. 6). 
In addition, AIRE deficiency also resulted in a decrease in mTEC I and 
mTEC III populations (Fig. 3f). By contrast, mTEC I cells and cTECs, 
showed an increase in frequency, while the mTEC IV population was 
unaffected (Fig. 3f). In order to better understand the lineage relation- 
ships of the individual mTEC subsets we performed in vivo fate map- 
ping using Csnb“’Rosa26"""™"” reporter mice (Extended Data Fig. 6 
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and Methods). Whereas Csnb (also known as Csn2) expression was 
restricted to the mTEC II and III subsets (Fig. 3g and Extended Data 
Fig. 6), the tdTomato reporter was expressed in mTEC II, II and IV, 
but was absent from mTEC I cells (Fig. 3h and Extended Data Fig. 6). 
This suggests that while mTEC IV cells may be developmentally derived 
from the Csnb* mTEC II and/or III populations, or from a common 
ancestor, the mTEC I population is not. 

Molecularly, the mTEC IV population is distinct from the other 
mTEC subsets, including the chromatin state, gene-expression profile 
and lack of stochastic gene expression (Figs. 1-3). On the basis of these 
data, we hypothesized that it may have a different functional role. This 
was further supported by specific expression of several genes that are 
associated with a rare epithelial lineage that is found in the gut, known 
as tuft (brush) cells!4-!° (Extended Data Fig. 7). In order to validate 
whether mTEC IV cells represent a putative tuft cell type, we compared 
their transcriptional profile to intestinal tuft cells and to the different 
TEC populations. Notably, mTEC IV cells were more similar to intesti- 
nal Hpgds-tdTomato” tuft cells than to any of the TEC subpopulations 
(Fig. 4a). Specifically, mTEC IV cells and intestinal tuft cells shared a 
large number of regulatory factors and tuft cell-specific genes includ- 
ing Avil, 1125 and Pou2f3. However, they also showed differential gene 
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expression, including for Gp2 and Gnb3, which were only expressed 
in mTEC IV cells (Fig. 4b, c and Extended Data Fig. 7). Moreover, 
microscopy analysis of mITEC IV cells showed a typical tuft-like stain- 
ing pattern in samples from both mice and humans (Fig. 4d, e). Finally, 
deficiency of a master regulator of intestinal tuft cells, Pou2f3, resulted 
in the complete loss of the mTEC IV population without affecting the 
development of any other TEC population (Fig. 4f, g and Extended Data 
Fig. 8), suggesting that they represent a bona fide tuft cell population. 

Because the mTEC IV population is characterized by high and 
exclusive expression of IL-25 in the stroma of the thymus (Fig. 4b), 
we next studied their potential impact on thymic cells expressing the 
IL-25 receptor (IL17rb) (Fig. 4h). To this end, we first characterized 
3,500 thymic CD45TIL-25R°* cells using MARS-seq and found five 
different subpopulations with distinct transcriptional states, including 
several CD3* thymocyte subsets, as well as thymus-resident type-2 
innate lymphoid cells (ILC2s) (Fig. 4h-j and Extended Data Fig. 9). 
Notably, the loss of mTEC IV cells in Pou2f3" mice was accompa- 
nied by a significant increase in the thymic ILC2 population (Fig. 4k), 
whereas it had no significant impact on other thymic CD45TIL-25Rt 
cells or the main T cell subsets (Extended Data Fig. 9). The increase 
in the ILC2 (Lin- TCR~CD127"8"'Tbet- RORyt- GATA3*) subset in 
Pou2f3"- versus wild-type mice was further confirmed by conventional 
flow cytometry (Fig. 41 and Extended Data Fig. 9). 

In summary, our study provides a comprehensive atlas of the stromal 
populations in the thymus of adult mice and during differentiation. We 
uncover unexpected complexity and diversity in this compartment, 
including an mTEC population with tuft-cell characteristics. This study 
also highlights important qualitative differences between the embry- 
onic and postnatal thymus, which have, thus far, remained uncharac- 
terized. As such, it clarifies many past controversies and confusions 
about the molecular, developmental and functional characteristics of 
the previously characterized TEC subsets and TRA expression, ena- 
bling the field to progress forward on a stable and common molecular 
blueprint. Nevertheless, several important open questions remain to 
be addressed in the future, including other functions of the mTEC IV 
population in the thymus and the lineage relationship and origins of 
the individual TEC subsets. Specifically, whether the mTEC II, III and 
IV populations are derived from a common ancestor cell, or whether 
mTEC IV cells are derived from the mTEC II and/or III populations. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded to 
allocation during experiments and outcome assessment. 

Mice. All mice were maintained under specific pathogen-free conditions at the 
Weizmann Institute’s animal facility and were handled in accordance with the 
guidelines of the Institutional Animal Care and Use Committee (25720316-2). 
Wild-type C57BL/6 (B6) mice were purchased from Harlan Laboratories. Aire 
knockout C57BL/6 mice were purchased from Jackson Laboratories. Sox9°C!” mice 
were generated as described previously’”. Pou2f3 knockout and Hpgds'@"*"° mice 
were maintained under pathogen-free conditions at the IGMM and IGF animal 
facilities (RAM). Csnb“* mice were generated using bacterial artificial chromo- 
some (BAC) recombination. In brief, homology arms corresponding to a sequence 
downstream to the ATG of the Csnb gene were cloned into a Cre-recombinase- 
coding plasmid (pConst-Cre; provided by G. Schutz), followed by homologous 
recombination of the cre coding sequence into the BAC containing the Csnb gene 
(RP23-218H23, CHORI). Flp recombination was then used to remove the antibi- 
otic resistance cassette. The final BAC was injected into fertilized BALB/c oocytes 
by the Weizmann transgenic core facility. Heterozygous mutants were backcrossed 
to C57BL/6 mice. Csnb’Rosa26""*” reporter mice were generated by breeding 
heterozygous Csnb"* mice with Rosa26'41™© mice (Jackson Laboratories). 
Human samples. Human thymus samples were obtained during the course of cor- 
rective cardiac surgery at Schneider Children’s Medical Center of Israel, following 
ethical approval (0781-16-RMC). An informed consent was signed by parents of 
the patients before obtaining the thymus. 

Ethical compliance. All animals were housed according to guidelines at the 
Weizmann Institute of Science and the IGMM and IGF animal facilities (RAM). 
All experimental procedures were approved by the Institutional Animal Care and 
Use Committee (IACUC), application number 25720316-2. 

All human experiments were done according to guidelines at the Weizmann 
Institute of Science and approved by Helsinki ethics committee, approval number 
0781-16-RMC. 

Isolation of mouse thymic stromal cells. Thymi from 4-8-week-old mice (unless 
otherwise stated, for embryonic or neonatal thymi) were placed into cold 1x PBS 
supplemented with 2% fetal bovine serum (FBS, Invitrogen). Thymi were chopped 
into small pieces and disintegrated by enzymatic digestion for 30-40 min in a 37°C 
water bath, using 0.3 mg ml! collagenase D (Roche, 1088858), 1 mg ml! dispase 
II (Roche, 04942078001) and 10 ng ml“! DNase I (Sigma-Aldrich, DN25) in RPMI 
supplemented with 2% FBS. Cells were then filtered through a 50-j1m mesh filter 
and washed with 5-10 ml MACS buffer (1 x PBS with 5 mM EDTA and 2% FBS), 
followed by centrifugation at 230g for 4 min. Percoll gradient density centrifugation 
was performed in order to enrich the stromal compartment. In brief, cells were 
resuspended in 2 ml of 1.115 g ml! isotonic Percoll (Sigma-Aldrich, P1644) and 
placed at the bottom of a tube. Subsequently, 1 ml of isotonic 1.065 g ml! Percoll 
and then 1 ml of 1x PBS were layered on top. The Percoll gradient was centri- 
fuged at 2,700 r.p.m., 4°C, with no deceleration for 30 min. The thymic stroma 
accumulated between the top and middle layers, and was collected and washed 
with MACS buffer and centrifuged at 230g for 4 min. Embryonic thymi were not 
subjected to Percoll separation, but were treated with red blood cell lysis buffer 
(15 mM ammonium chloride, 1 mM potassium bicarbonate, 10 tM EDTA in 
double distilled water, pH 7.3). 

Isolation of thymic haematopoietic cells. Thymi from 6-12-week-old mice were 
surgically removed and placed in PBS on ice. Thymi were trimmed of fat and 
connective tissues, and thymocytes were extracted into a single-cell suspension 
by pressing the thymic lobes against a 70-|1m cell strainer. Cells were washed in 
MACS buffer (1x PBS with 2% FBS and 5 mM EDTA pH 8.0). 

Isolation of intestinal epithelial cells. Mouse small intestines were isolated, 
flushed with PBS and incised along their length. The tissue was incubated in 
30 mM EDTA (Sigma-Aldrich) in HBSS pH 7.4 (Life Technologies) on ice, and 
transferred to DMEM (Life Technologies) supplemented with 10% FBS (Sigma- 
Aldrich). Vigorous shaking yielded the epithelial fraction that was then incubated 
with 100 tl of dispase (BD Biosciences) in 10 ml of HBSS, supplemented with 
100 j1l of DNase I at 2,000 Kunitz (Sigma). The single-cell preparation was obtained 
by filtration through a 30-j1m mesh and used for further staining. 

Antibodies and reagents for flow cytometry. APC-Cy7-EpCAM (118218), 
PE-Ly51 (108308), pacific blue-I-A/I-E (107620), PerCP-Cy5.5-CD45 (103132), 
FITC-CD34 (343603), APC-CD31 (102409), PE-Cy7-CD45 (103114), FITC- 
ITGB4 (123605), PE-ITGB4 (123610) and PE-IL-17RB (IL-25R,146305) were 
purchased from Biolegend; APC-LICAM (FAB5674R) was purchased from Novus; 
APC-GATA3 (560078), brilliant violet 650-ROR t (564722), biotinylated CD3 
(553060), biotinylated CD4 (553728), biotinylated CD8 (553029), biotinylated 
B220 (553086), biotinylated Ter119 (553672), biotinylated CD11c (553800), 
biotinylated Grl (553125), FITC-CD45 (553080), FITC-CD8 (553031), APC- 
Ter119 (557909), PE-Cy7-CD45 (552848), brilliant violet 711-CD4 (563726), 
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APC-CD8 (553035), horizon V500-CD44 (560780), brilliant violet 711-Tcrgd 
(563994) and horizon V450-Tcrb (560706) were purchased from BD; and PerCP- 
eFluor710-Ly6d (46-5974-80), PE-Cy7-CD127 (25-1273-82), PE-Tbet (12- 
5825-82), eFluor780-—CD4 (47-0042-82) and eFluor780-CD25 (47-025182) were 
purchased from eBioscience. The following materials were also used for FACS 
staining: 7-AAD Viability Staining Solution (Biolegend, 420404); Biotinylated 
Ulex Europaeus Agglutinin I (UEA I) (Vector laboratories, B-1065), PE-Cy7- 
streptavidin (405206, Biolegend), PerCP-Cy5.5-streptavidin (554064, BD), 
SYTOX (S34857, Invitrogen), FcR blocker (BE0307, Bio X Cell). 
Flow cytometry and sorting. Cells were stained in MACS buffer (1x PBS with 
2% FBS and 5 mM EDTA pH 8.0) with specific antibodies for 30 min at 4°C. For 
haematopoietic cells, to avoid non-specific binding of antibodies to Fc, receptors, 
anti-mouse CD16/CD32 monoclonal antibody (Bio X cell, BE0307 or Biolegend, 
10130) was added to the antibody mix. Following staining, cells were washed and 
resuspended in MACS buffer. Secondary staining with streptavidin was performed 
in a similar manner. After the wash, cells intended for intracellular staining were 
fixed and permeabilized using the eBioscience fixation/permeabilization kit 
according to kit instructions, and stained with antibodies of intracellular markers 
for 2 hat 4°C. Cells were sorted on a BD FACSAria Special Order Research Product 
(SORP) or BD FACSAria Fusion/II or II, or analysed on a BD LSRFortessa or 
BD FACSCantoll. Spectral overlap between fluorescent dyes was compensated 
using single-stained controls. Pre-gating was first done for live cells (in non-fixed 
samples) based on a 7-AAD, DAPI or SYTOX stain, followed by single-cell gating 
according to the FSC-A versus FCS-W plot. Data analysis was performed using 
FlowJo software (Tree Star Inc.). 

mTEC subsets (Fig. 2a) were sorted from CD45" EpCAMt cells using the fol- 
lowing sub-sorting gates: mTEC I: MHC-II'“ITGB4*L1CAM; mTEC II: MHC- 
II*Ly6d-; mTEC II: ITGB4- L1CAM Ly6éd*; mTEC IV: MHC-IE°*LICAM+. 
Single-cell index sorting. Isolated cells were single-cell sorted into 384-well cell 
capture plates containing 2 11 of lysis solution and barcoded poly(T) reverse- 
transcription primers for single-cell RNA-seq. Barcoded single-cell capture plates 
were prepared with a Bravo automated liquid handling platform (Agilent), as 
described previously*. To record marker levels of each single cell, the FACS Diva 7 
‘index sorting’ function was activated during single-cell sorting. 
Library preparation for single-cell RNA sequencing. Single-cell libraries were 
prepared as previously described‘. In brief, 384-well plates, which contained lysis 
buffer and barcoded reverse-transcription poly-T primers, were immediately 
spun down and placed on dry ice. For barcoding and reverse transcription, a 
reverse-transcription reaction mix was added and the plates were placed in a PCR 
machine set to the appropriate program. All barcoded samples were pooled, fol- 
lowed by addition of exonuclease to remove excess RT-PCR primers. A purification 
step using SPRI beads that bind cDNA and RNA was performed after this step, as 
well as after each of the following steps. The pooled single-stranded cDNA was 
converted to a double-stranded DNA using a designated kit, in order to perform 
in vitro transcription of RNA molecules. The template DNA was then removed 
using DNase, and the generated RNA was fragmented and ligated to barcoded 
Illumina adapters. Reverse transcription of this ligation product was done using 
primers specific for the Illumina adapters, and libraries of the resulting cDNA were 
generated and enriched by 12-15 PCR cycles. 
Low-level processing and filtering. All RNA-seq libraries (pooled at equimo- 
lar concentration) were sequenced using the Illumina NextSeq 500 at a median 
sequencing depth of 16,289 reads per single cell. Sequences were mapped to mouse 
genome (mm9), demultiplexed and filtered as previously described*"®, extracting a 
set of unique molecular identifiers (UMI) that define distinct transcripts in single 
cells for further processing. We estimated the level of spurious UMIs in the data 
using statistics on empty MARS-seq wells (median noise 2.8%; Extended Data 
Fig. 2). Mapping of reads was done using HISAT (version 0.1.6)!°; reads with mul- 
tiple mapping positions were excluded. Reads were associated with genes if they 
were mapped to an exon, using the UCSC genome browser as reference. Exons of 
different genes that shared genomic position on the same strand were considered 
a single gene with a concatenated gene symbol. Cells with less than 500 UMIs 
were discarded from the analysis. After filtering, cells contained a median of 1,711 
unique molecules per cell. 
Data processing and clustering. The MetaCell pipeline was used to derive inform- 
ative genes and compute cell-to-cell similarity, to compute KNN graph covers 
and derive distribution of RNA in cohesive groups of cells (or metacells), and 
to derive strongly separated clusters using bootstrap analysis and computation 
of graph covers on resampled data. The MetaCell package is described in detail 
in Supplementary Note 1. Default parameters were used unless otherwise stated. 

Clustering for Figs. 1, 2 was done on a combined set of cells from two sources: 
(1) 1,972 CD45~ thymic cells and (2) 1,542 CD45-EpCAM? thymic cells 
(Extended Data Fig. 2). Clustering resulted in 49 clusters. Clusters with increased 
expression of Hbb-b1, Trbc2 or C1qb, which are markers for red blood cells, T cells 
or macrophages, respectively (mean expression > 10 times the median across clus- 
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ters), were marked as contaminants and discarded from further analysis (Extended 
Data Fig. 3). We performed hierarchical clustering over the clusters structure and 
divided clusters into epithelial, endothelial and fibroblast groups by cutree. 
Mapping cells to an existing cluster model. Given an existing reference single-cell 
dataset and cluster model, and a new set of single-cell profiles, we extracted for each 
new cell the K (K= 10) reference cells with top Pearson correlation on transformed 
marker gene UMIs as described above. The distribution of cluster memberships 
over these K-neighbours was used to define the new cell reference cluster (by 
majority voting), and was applied for visualizing new cells by weighted average of 
the x and y coordinates of the clusters. 

Clustering of development TEC and comparison to the existing model. 
Clustering of TEC during embryonic and postnatal development (Fig. 3) was done 
ona combined dataset of 1,343 E14.5, 895 E18.5 and 836 6 days postnatal epithe- 
lial cells. Two-dimensional projection of the resulting clustering was produced 
by MetaCell. However, in order to maintain the structure of the two-dimensional 
projection in Fig. 2, while enabling the discovery of new transcriptional states, we 
computed the KNN structure of the combined developmental and mature TEC. 
Developmental clusters with more than 20% cells for which >20% of their neigh- 
bours are within the mature dataset were associated with the mature projection, 
and their two-dimensional coordinates were determined by their mature neigh- 
bours. All other developmental clusters were assigned their regular coordinates 
(Fig. 3a and Extended Data Fig. 6). 

For further refinement of the early TEC population (as in Fig. 3g), all develop- 
mental metacells with mean expression >0.25 times the median across clusters 
were included. 

Gene modules and cell cycle analysis. Identification of ribosomal, cell cycle or 
other broadly expressed gene modules was done by clustering genes in a downsam- 
pled UMI matrix (500 molecules per cell). We filtered genes with total molecule 
(UMI) count lower than 5 and variance-to-mean ratio lower than 1.2. Hierarchical 
clustering using Ward’s method was performed for detecting 30 fine-grained 
clusters. After removing clusters with mean Pearson intracorrelation lower than 
0.025, 26 gene modules were retained. Manual annotation of the gene clusters was 
performed. This resulted in the identification of 26 modules with 8-136 genes, 
among which the cell cycle module contained 73 genes. Ribosomal modules (86 
genes) were excluded from clustering analysis. Expression of cell cycle genes is a 
good indicator of proliferation”®. To determine proliferation status of genes, we 
examined the pooled normalized expression of the cell cycle module across genes. 
This measurement showed a bimodal distribution, correlating with the total UMI 
count of cells (Extended Data Fig. 4e). 

GO enrichment. Gene enrichment analysis was done using metascape software 
with mouse whole-genome backup. 

Analysis of stochastically expressed genes. In order to define stochastically 
expressed genes, genes with less than 50 total UMIs were discarded (list A). Of 
this list, genes for which the corrected variance was greater than 1 (by a linear fit 
of the variance to the mean expression) were defined as variably expressed (list 
B). Gene-to-gene Pearson correlations were then computed on the UMI values of 
genes from list B (normalized to cell size). In order to discard tightly correlated 
gene expression programs, only genes for which the third highest correlation with 
other genes was less than 0.25 were considered as stochastically expressed (list C; 
Supplementary Table 4). 

Anestablished list of TRA genes was taken from a previous publication”!. AIRE 
dependency of these TRA was determined by a twofold reduction in Aire knockout 
mice measured in the previous study”!. Both AIRE-dependent and -independent 
lists were intersected with list B for further analysis. 

Immunofluorescence staining of frozen thymic sections. Thymi from 4—8-week- 
old female mice were embedded in OCT compound (Tissue-Tek, Sakura) and 
frozen on dry ice. Cryostat sections (6 1m) were fixed with ice-cold acetone for 
10 min and incubated with primary antibody (anti- AIRE AF488-conjugated, 
04-150; Millipore) diluted in 1% BSA in PBS for 60 min at room temperature. 
Sections were washed three times with PBS and incubated with DAPI staining for 
10 min at room temperature, followed by three washes with PBS. 

Immunoflourescence staining of PFA-fixed frozen thymic sections. 
Thymi from 4-8-week-old wild-type, Sox9°FFP, Csnb“+ Rosa26'4" and 
Csnb"?- Rosa264!mato mice were isolated, cleaned and fixed for 3 h with ice-cold 
3.7% formaldehyde, followed by overnight incubation with cryoprotection solu- 
tion (3.7% formaldehyde, 30% sucrose in 1x PBS). Thymi were embedded in 
OCT and frozen on dry ice. Cryostat sections (6-7 |tm) were permeabilized and 
blocked with blocking buffer (TBS, pH 7.4, 5% goat serum or BSA, and 0.1% Triton 
X-100) for 30 min at room temperature. Sections were then incubated with primary 
antibodies diluted in blocking buffer overnight at 4°C. Primary antibodies used 
were as follows: anti- PSMB11 (pd021, MBL); anti-PIGR (AF2800, R&D Systems); 
anti-DCLK1 (ab37994, Abcam); APC-conjugated anti-ITGA6 (313615; Biolegend). 
Sections were washed twice with TBST (1x TBS supplemented with 0.1% Tween- 
20). Sections stained with unconjugated antibodies were incubated with a second- 


ary antibody (goat anti-rabbit AF555, Jackson Laboratories) for 60 min at room 
temperature. DAPI staining was performed for 10 min at room temperature. 
Immunoflourescence staining of paraffin-embedded thymic sections. Thymi 
from 4—8-week-old mice or 8-day-old male humans were fixed in 4% PFA for 
48 h, followed by embedding in paraffin. Subsequently, 5-j1m-thick sections were 
dewaxed in xylene and rehydrated in graded alcohol baths. Antigen retrieval was 
performed by boiling slides for 20 min in 10 mM sodium citrate buffer, pH 6.0. 
Nonspecific binding sites were blocked in blocking buffer (TBS pH 7.4, 5% goat 
serum and 0.1% Triton X-100) for 30 min at room temperature. Sections were 
incubated with primary antibodies diluted in blocking buffer overnight at 4°C. 
Primary antibodies used were as follows: anti- VILLIN (MAB1671, Millipore); anti- 
COX] (sc-1754, Santa Cruz Biotechnology); anti-DCLK1 (ab37994, Abcam). Slides 
were washed two times with TBST before incubation with fluorescent secondary 
antibodies conjugated to AF488, AF555, Cy3 or Cy5 (Jackson ImmunoResearch 
Laboratories) and Hoechst (Sigma-Aldrich) or DAPI in TBS-0.1% Triton X-100. 
Single-molecule RNA fluorescence in situ hybridization (FISH). Single- 
molecule FISH probe libraries consisting of 48 probes with a length of 20 bp were 
designed as previously described”, constructed and provided by Agentek. The 
following probe libraries were used: Avil, Sbsn coupled to Cy5, Epcam and Pigr 
coupled to AF594. RNA FISH was performed as previously described. In brief, 
thymi from 4-8-week-old female mice were isolated, cleaned and fixed for 3 h with 
ice-cold 3.7% formaldehyde followed by overnight incubation with cryoprotection 
solution (3.7% formaldehyde, 30% sucrose in PBS). Thymi were embedded in OCT 
and frozen on dry ice. Cryostat sections (5-8 jum) were air-dried and fixed again 
with 3.7% formaldehyde for 5 min following by 2 h incubation with 70% ethanol at 
4°C. Sections were rehydrated with 2x SSC, and treated with proteinase K in 2x 
SSC for 10 min at room temperature. Hybridization was performed overnight by 
20% formamide in 2x SSC with 0.1 ng jl! of the desired probes at 30°C. DAPI (to 
stain the nuclei) was added during the washes, and sections were incubated with 
fresh GLOX buffer (10 mM Tris pH 8.0 and 0.4% glucose in 2x SSC). 

Imaging and image analysis. Imaging was performed on Ultima Multiphoton 
Microscope, Nikon Eclipse TI-S fluorescence microscope, or Nikon-Ti-E inverted 
fluorescence microscope with 60x and 100~ oil-immersion objectives and a 
Photometrics Pixis 1024 CCD camera, using MetaMorph software (Molecular 
Devices) or NIS element software (Nikon). Image analysis was performed with 
ImageJ software. 

Indexing first chromatin immunoprecipitation followed by deep sequencing 
(iChIP-seq). iChIP-seq was prepared as previously described". In brief, cells were 
cross-linked for 8 min in 1% formaldehyde and quenched for 5 min in 0.125 M 
glycine before sorting. Cells were sorted using the described sorting strategy and 
frozen. Cell pellets were lysed in 0.5% SDS and sheared with the NGS Bioruptor 
Sonicator (Diagenode). Sheared chromatin was immobilized on 15 tl Dynabeads 
Protein G (Invitrogen) with 1.3 jg of anti-H3 antibody (Abcam). Magnetized 
chromatin was then washed with 10 mM Tris-HCl supplemented with 1x PI. 
Chromatin was end-repaired, dA-tailed and ligated with sequencing adapters 
containing Illumina P5 and P7 sequences. Indexed chromatin was pooled and 
incubated with 2.5 jug H3K4me2 antibody (ab32356, Abcam) at 4°C for 3 h and 
for an additional hour with protein G magnetic beads (Invitrogen). Magnetized 
chromatin was washed and reverse cross-linked. DNA was subsequently purified 
with 1.65 SPRI and amplified by PCR with 0.5 1M of forward and reverse prim- 
ers containing Illumina P5-rd1 and P7-rd2 sequences. Library concentration was 
measured with a Qubit fluorometer and mean molecule size was determined by 
TapeStation (Agilent). DNA libraries were sequenced on an Illumina NextSeq 500 
with an average of over 10 million aligned reads per replicate. 

ATAC-seq. To profile open chromatin, we used an assay of transposase-accessible 
chromatin following sequencing (ATAC-seq) as previously published”’, with mod- 
ifications as previously described". In brief, cell populations were sorted in 400 jul 
of MACS buffer (1 x PBS, 0.5% BSA, 2 mM EDTA) and pelleted by centrifugation 
for 15 min at 500g and 4°C with low acceleration and brake settings. Cell pellets 
were washed once with 1 x PBS and cells were pelleted by centrifugation using the 
previous settings. Cell pellets were resuspended in 25 1] of lysis buffer (10 mM Tris- 
HCl (pH 7.4), 10 mM NaCl, 3 mM MgChy, 0.1% Igepal CA-630) and nuclei were 
pelleted by centrifugation for 30 min at 500g, 4°C with low acceleration and brake 
settings. Supernatant was discarded and nuclei were resuspended in 25 \1l reaction 
buffer containing 2 11 of Tn5 transposase and 12.5 jl of TD buffer (Nextera Sample 
preparation kit from Illumina). The reaction was incubated at 37°C for 1 h. Then, 
5 ul of clean-up buffer (900 mM NaCl, 300 mM EDTA), 2 jl of 5% SDS and 2 pl 
of Proteinase K (NEB) were added and incubated for 30 min at 4°C. Tagmentated 
DNA was isolated using 2 SPRI beads clean-up. For library amplification, two 
sequential nine-cycle PCR runs were performed in order to enrich small tagmen- 
tated DNA fragments. We used 2 il of indexing primers included in the Nextera 
Index kit and KAPA HiFi HotStart ready mix. After the first PCR, the libraries were 
selected for small fragments (less than 600 bp) using SPRI clean-up. Then a second 
PCR was performed with the same conditions in order to obtain the final library. 
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Processing of iChIP-seq, ATAC-seq and chromatin peak calling. Reads were 
aligned to the mouse reference genome (mm9, NCBI v.37) using Bowtie aligner 
version 1.0.04 with best match parameters (bowtie -m 1-sam-best-strata -v 2). To 
identify regions of enrichment (peaks) from ChIP-seq reads of H3K4me2, we used the 
HOMER package makeTagDirectory followed by the findPeaks command with the 
histone parameter” and IDR filtering for reproducible peaks across replicates”. Union 
peaks file were generated by combining and merging overlapping peaks in all samples. 
Chromatin analysis. For clustering of differential peaks, we first averaged peak 
sizes across replicates. We defined differential peaks as peaks for which the max- 
imum value is more than fourfold higher than their minimum value. We then 
normalized peaks intensities and performed K means clustering (K=7). For motif 
finding, we independently called peaks in ATAC-seq, as above, and identified the 
maximum peak that overlapped each H3K4me2 region. The overlapping sequences 
were input for HOMER package motif finder algorithm findMotifGenome”>. 
Gene tracks and visualization. All gene tracks were visualized as bigWig files 
of the combined replicates normalized to 10,000,000 reads, using Integrative 
Genomics Viewer (http://www.broadinstitute.org/igv). 
Real-time PCR analysis. Cells were sorted into 40-50 1l of lysis/binding 
buffer (Life Technologies). mRNA was captured with 15 \1l of Dynabeads 
oligo(dT) (Life Technologies), washed and eluted at 85°C with 10 jl of 10 mM 
Tris-Cl (pH 7.5). The purified RNA was used for cDNA synthesis using 
the High-Capacity cDNA Reverse-Transcription kit (Applied Biosystems) 
and polyT primers. The subsequent qPCR analysis was performed using 
the Fast SYBR Green Master Mix (Life technologies). Differential expres- 
sion was calculated according to the AAC, method. Specific qPCR primers: 
Actb: GGAGGGGGTTGAGGTGTT, TGTGCACTTTTATTGGTCTCAAG; 
Sox4: GCTGGGCTTTCTCCTCCT, AGGCTGGCCTGCTACTCG; Aire: 
TGGGCTGATTAGGACCAAGA, ACAAAGATCAGGGCCATCTG; Avil: 
GCATCAGGACCCACATCTGC, ATGCTGTGGCACATGGTAGAG; Sbsn: 
CACCATGCCCTAAACTGATGC, ACAAAGCTCAAAGCAGCCCTC; cre: 
AGGAGAATGTGGATGCTGGGG, CAATTTCGGCAATGCGCAGC; Csnb: 
AAACTTCAGAAGGTGAATCTCATGG, GCTGGATGTTTTGTGGGACG. 
Lists of ribosomal and cell cycle modules. Ribosomal genes. 2810422JO5Rik, 
AC151602.1, AL663027.1, Cfll, Eef1b2, Ftll, Gm10059, Gm10076, Gm10443, 
Gm11361, Gm11808, Gm11942, Gm12630, Gm13408, Gm14456, Gm15427, 
Gm15459, Gm15710, Gm3788, Gm4149, Gm5244, Gm5559, Gm8730, Gm8759, 
Gnb2l1, Hspa8, Mif, Rpl10, Rpl13, Rpl13a, Rpl14, Rpl18a, Rpl22, Rpl26, Rpl28-ps3, 
Rpl29, Rpl32, Rpl34, Rpl37, Rpl37a, Rpl38, Rpl38-ps2, Rpl4, Rpl7a-ps12, Rpls, 
Rpl9-ps6, Rplp0, Rplp1, Rplp2, Rps10, Rps10-ps1, Rps14, Rps15, Rps18, Rps19, 
Rps2, Rps20, Rps21, Rps26, Rps28, Rps3, Rps3a, Rps5, Rps8, Rps9, Rpsa, Rpsa-ps10, 
Snord35a, Tmsb10, Tpt1, B2m, Eeflal1, Gas5, Gm13456, Gm15500, Gm16247, 
Rp123, Rpl35a, Rpl7, Rps11, Rps24, Rps25, $100a11, Tmsb4x, Ubl5, mmu-mir-703. 
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Cell cycle genes. 2810417H13Rik, 2900006K08Rik, 4833427G06Rik, 4930473A 06Rik, 
5133401NO9Rik, 6820408C15Rik, Arhgap11a, Arl6ip1, Birc5, Ccdc108, Ccdc113, 
Ccdc151, Ccdc17, Ccdc19, Ccdc30, Ccdc39, Ccdc40, Ccdc67, Ccno, Cdc20, Cdca8, 
Cenpf, Ckap2, Ckap2l, Cks1b, Dedc2a, Dek, Dnahc9, Dnajb13, E030019BO6Rik, 
Foxm1, Gm11423, Gm9938, H2afx, Histih1a, Hist1h1b, Hist1h1d, Hist1hle, 
Hist1h2ac, Hist 1h2ae, Hist1h2ao, Hist1h2ap, Hist1h3c, Hist1h3e, Hist1h4d, Hmgb2, 
Hnrnpa2b1, Ift46, Kif15, Kif24, Lrrc23, Lrrc46, Mki67, Phospho2, Pih1d2, Plk1, 
Rsph1, Rsph9, Smc4, Supt16h, Tekt1, Tekt4, Tmem107, Top2a, Ttll6, Tubalb, Tubb2c, 
Tubb5, Ube2c, Wdr52, Wdr65, Zmynd10. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Code availability. No custom software was used to collect data. iChIP-seq and 
ATAC-seq data analyses, including motif finding, were done with the HOMER 
package”. Single-cell data were analysed with the MetaCell package, which is 
available upon request. 

Data availability. RNA, iChIP-seq and ATAC-seq data reported in this paper were 
deposited with Gene Expression Omnibus under accession numbers: GSE103967, 
GSE103968, GSE103969 and GSE103970. 
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Extended Data Fig. 1 | Single-cell data quality controls. a, Summary of detected in each cell (c), fraction of analysed cells from each amplification 
all single cells analysed in this study, divided into experimental procedures. _ batch (d) and estimation of technical noise for each amplification batch (e). 
‘nbatches’ indicates number of technical replicates; ‘ncells’ indicates Cells are coloured by experimental procedure. Technical noise is assessed 
number of cells after filtering (see Methods). b-e, Colour-coded tracks by genomic UMIs in empty wells as previously described* (see Methods). 


summarizing the number of Illumina reads per cell (b), transcripts (UMI) 
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Extended Data Fig. 3 | Thymic epithelial cells are characterized by 
four subsets of mTEC and a single cTEC subset. a, Heat map showing 

a metacell analysis of 2,341 thymic epithelial cells (CD45- EpCAM*), 
featuring the 73 most variable genes, from 15 biological replicates of 
4-6-week-old mice. Colour bar represents separation of 36 metacells into 
five main populations. b, Two-dimensional graph representation of the 
metacell model in Fig. 1a (see Methods). Big circles represent metacells, 
and are colour-coded as shown ina. c, FACS index sorting measurement 
of LY51 and UEA1 in epithelial cells. Cells are coloured based on cluster 
association as determined in a. Dashed lines outline LY51*UEA1~ and 
LY51-UEAI1* gates. d, Fraction of TEC subsets out of LY51*UEA1~ and 
LY51-UEAI* populations, assessed by gating single cells on index sorting 
protein measurements of UEA1 and LY51 inc. e, Controlling for batch 
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effect as determined by the relative share of each batch in all metacells. 
Batches are ordered by biological replicate (marked by dashed lines) and 
sorting scheme (either CD45~ or CD45 EpCAM”*). f, g, Single molecule 
FISH assay on 5-8-j1m-thick cryosections (see Methods) using fluorescent 
probes against the genes Epcam and Sbsn (f) or Avil (g). Blue, DAPI. The 
experiments were repeated independently four times with similar results. 
h, Immunofluorescence images of the protein markers: PIGR and DCLK1. 
Medulla (M) and cortex (C) are separated by dashed lines, distinguished 
by nuclei density. Blue, DAPI. f-h, Scale bars, 20 jtm. The experiments 
were repeated independently twice with similar results. i, Projection of 
representative differentially expressed genes onto the two-dimensional 
graph of epithelial cells. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | TEC dynamics during thymus development. 

a, Flow cytometry scheme of thymic epithelial cells from different 
development time points. Numbers indicate fraction of CD45" EpCAM* 
cells. b, Summary of the K (K=50) nearest neighbours of embryonic 
cells, grouped into metacells. Neighbours of adult origin are coloured 

by TEC subsets, and of embryonic origin are coloured by developmental 
time point. Metacells with more than 20% adult neighbours were assigned 
to a TEC subset. c, Normalized mean expression of differential genes 
across TEC mature populations (4 weeks old) and unassigned cells from 
developmental time points (grey). n = 4 (E14.5), 3 (E18.5) and 2 (6 days) 
independent animals. Data are median (bars) and individual animals 
(dots). d, Differential gene expression between early cTEC (e-cTEC) from 
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three developmental time points and the mature cTEC. Axes represent 
UMI count per 1,000 UMI, normalized to cell numbers. e, Distribution 
of cell cycle gene expression across cells from developing TEC is bimodal. 
The red line indicates the empirical proliferation threshold. f, Frequency 
of proliferating cells in the epithelial population at each developmental 
time point. Colour code as in b. g, Gene pairwise Spearman correlation 
over 2,319 e-cTEC single cells reveals three gene modules jointly expressed 
across embryonic early TEC and mature cTEC populations. h, GO 
annotations enrichment analysis of the three cTEC gene modules. For 
cell cycle, n = 114; antigen presentation, n = 33; and antigen processing, 
n=56. 
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Extended Data Fig. 5 | Genetic and epigenetic characterization of TEC 


subsets. a, In silico gating of mTEC I-IV populations by index sorting 
measurements of surface markers. The same gating schemes were used 
to purify these populations by FACS (Fig. 2a). Cells are colour-coded as 
shown in Fig. 1. Blue, cTEC; light blue, mTEC I; red, mTEC II, yellow, 
mTEC III; green, mTEC IV. b, Relative enrichment (log) fold change 


compared to total CD45" EpCAM‘ epithelial cells) of the individual mTEC 


correlation of 29,472 H3K4me2 peaks (top) or ATAC-seq peaks (bottom) 


from mTEC I-IV sorted populations. Biological replicates for each 
population are shown. d, Scatter plots of mTEC I-IV H3K4me2 ChIP-seq 
peaks in biological duplicates. e, Summary of the most significant motifs 
enriched in each cluster of mTEC I-IV H3K4me2 differential peaks 

(Fig. 2e). P values are derived from binomial tests after FDR correction for 
multiple hypotheses. n = 2,302 differential peaks. 


I-IV subsets gated according to a. c, Heat map showing pairwise Spearman 
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Extended Data Fig. 6 | Characterization of AIRE-dependent mTEC 
subsets. a, Variance of genes plotted against their mean value (genes with 
>50 total UMI are shown). Orange dots indicate variable genes. 

b, Pairwise Pearson gene correlations in AIRE-dependent (left) and 
AIRE-independent (right) TRA gene lists across 2,341 TEC single 

cells. Levels of differential expression (highest change of expression in 
cluster compared to median across all clusters) are indicated as bars; bar 
colours indicate cluster association to TEC population. c, Comparison 
of stochastic gene expression between Aire knockout and wild-type 

cells in mTEC III and IV populations. Marginal distribution is shown 

as histogram. Axes represent UMI count per 1,000 UMI, normalized to 
cell numbers. d, Flow cytometry scheme of thymic Aire knockout cells 


showing the percentage of each TEC population compared to wild-type 
percentage (shown in brackets). e, Representative immunofluorescence 
images of two independent experiments, for tdTomato across different 
organs. In the thymus, the medulla (M) and cortex (C) are separated 

by dashed lines, distinguished by nuclei density. Blue, DAPI. Scale 

bars, 100 um. f, qPCR analysis of Csnb (left) and cre (right) genes in 
Csnb"** Rosa26'@ mo (cre+) and Csnb?’- Rosa26"4®"*"° (cre-) across 
thymic populations. Dot plots display mean and error bars indicate s.e.m. 
n=2 (wild type) or n=3 (Csnb“**) biologically independent animals. 

g, Flow cytometric analysis of tdTomato expression in mTEC subsets 
(colours as in Fig. 1) isolated from thymi of Csnb‘*+ Rosa26"*”™"_ or Csn 
b°*Rosa26'47™0_reporter mice. 
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Extended Data Fig. 7 | Comparing intestinal tuft cells and the mTEC 
IV population. a, Flow cytometry scheme of mTEC IV cells (Sox9- 
eGFP*L1CAM”*) sorting. b, Flow cytometry scheme of small intestine 
Hpgds-tdTomato* tuft cell sorting. c, Heat map showing gene expression 
profiles across 1,903 intestinal tuft (Hpgds-tdTomato*) single cells, 
grouped into 68 metacells. d, Comparison of gene expression between tuft 
cells isolated from small intestine (Hpgds-tdTomato*; x axis) and mTEC 
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IV cells isolated from thymus (CD45~ Ep>CAMt Sox9-eGFP*L1ICAM*; 

y axis). Axes represent UMI count per 1,000 UMI, normalized to cell 
numbers. e, Normalized mean expression of differential genes across 
TEC populations, sorted mTEC IV cells (LICAM™ Sox9-eGFP*) and 
intestinal Tuft (Hpgds-tdTomato‘) cells. f, GO annotations enrichment in 
differential genes (fold change >2) between intestinal tufts (n = 634) and 
mTEC IV cells (LICAM*Sox9-eGFP*) (n= 1,308). 
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Extended Data Fig. 9 | mTEC IV shape the thymus immune niche. 

a, Heat map showing a metacell analysis of 3,500 CD45*IL-25R* cells 
across five clusters. b, Differential gene expression in ILC2 cells from 
two biological replicates compared to other CD45tIL-25R’% cells (rest) 
from each replicate. Axes represent log, fold change. c, Percentages 

of CD45* cells in Pou2f3 knockout and wild-type thymi, determined 

by flow cytometry. Circles and diamonds indicate independent mice, 
centre line indicates the mean value. d, Flow cytometry analysis of cells 
expressing CD4, CD8, CD25 and CD44 in Pou2f3 knockout and wild- 
type thymi. The experiment was repeated independently four times with 
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similar results to confirm reproducibility. e, Flow cytometry sorting 
scheme of thymic CD45*IL-25R°* cells from Pou2f3 knockout and wild- 
type thymi. f, Percentage of CD45*IL-25R? cells in Pou2f3 knockout and 
wild-type thymi. Circles and diamonds indicate independent mice, centre 
line indicates the mean value. g, Percentages (left) and numbers (right) of 
ILC2 (Lin"-TCR~CD127* GATA3*Roryt_ ) cells within the single-cell gate 
in Pou2f3 knockout and wild-type mice. Circles and diamonds indicate 
independent mice, centre line indicates the mean value. A one-tailed 
Student’s t-test was used for the comparison, *P < 0.05. 
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Extended Data Table 1 | Regions of enhancer enrichment peaks 


a. 


number of iChiP aligned number of IDR 
reads peaks 
mTEC-I a 13831980 
mTEC-I b 3359743 aad 
mTEC-II a 32464771 
mTEC-II b 29099226 cai 
mTEC-III a 17032161 
mTEC-III b 17710805 yeere 
mTEC-IV a 18818509 
mTEC-IV b 17910967 ane 
b. 
Sample Replicate | number of ATAC aligned reads Number of peaks 
mTEC-I a 12521131 44288 
mTEC-I b 10665995 39040 
mTEC-II a 6626821 19008 
mTEC-II 8633041 27739 
mTEC-II 10899189 10525 
mTEC-III 2387938 19507 
| mTEC-IV | a 8388701 30017 
mTEC-IV b 13972276 38108 


a, H3K4me2 iChIP-seq peak calling. b, ATAC-seq peak calling. 
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Thymic tuft cells promote an IL-4-enriched 
medulla and shape thymocyte development 


Corey N. Miller!?, Irina Proekt!?, Jakob von Moltke***, Kristen L. Wells®, Aparna R. Rajpurkar*, Haiguang Wane®, 
Kristin Rattay’, Imran S. Khan!*"", Todd C. Metzger!?, Joshua L. Pollack*’, Adam C. Fries®, Wint W. Lwin!?, Eric J. Wigton’, 


Audrey V. Parent! 
Mark S. Anderson!?:3* 


The thymus is responsible for generating a diverse yet self-tolerant 
pool of T cells!. Although the thymic medulla consists mostly of 
developing and mature AIRE* epithelial cells, recent evidence has 
suggested that there is far greater heterogeneity among medullary 
thymic epithelial cells than was previously thought”. Here we 
describe in detail an epithelial subset that is remarkably similar 
to peripheral tuft cells that are found at mucosal barriers>. Similar 
to the periphery, thymic tuft cells express the canonical taste 
transduction pathway and IL-25. However, they are unique in their 
spatial association with cornified aggregates, ability to present 
antigens and expression of a broad diversity of taste receptors. 
Some thymic tuft cells pass through an Aire-expressing stage 
and depend on a known AIRE-binding partner, HIPK2, for their 
development. Notably, the taste chemosensory protein TRPM5 is 
required for their thymic function through which they support the 
development and polarization of thymic invariant natural killer 
T cells and act to establish a medullary microenvironment that is 
enriched in the type 2 cytokine, IL-4. These findings indicate that 
there is a compartmentalized medullary environment in which 
differentiation of a minor and highly specialized epithelial subset 
has a non-redundant role in shaping thymic function. 

Medullary thymic epithelial cells (mTECs) develop through a coordi- 
nated pathway in which AIRE-negative MHC-II precursors receive 
inductive signals, including RANKL and CD40L, mature into MHC- 
ThishCpgohish (mTECHs) ATRE- and tissue-specific antigen (TSA)- 
expressing cells, and promote central tolerance through negative 
selection and generation of thymically derived regulatory T cells”. 
We developed a lineage-tracing model in which Aire-expressing 
cells were inducibly labelled with a fluorescent reporter (Aire ERT. 
Rosa26CA© stepflox-tdTomato or AireTERT2. Rosq26CAG stopflox-zsGreen, inducible 
AIRE lineage trace, iALT)*. In iALT mice that were treated with a single 
dose of tamoxifen, four mTEC subsets could be distinguished based on 
MHC-II expression and REP, including a post-Aire (MHC-II'°*RFP*) 
population*. 

To investigate mTEC heterogeneity, we used fluorescence-activated 
cell sorting (FACS) to sort iALT-labelled mTECs and examined 
their transcriptomes (Fig. 1a, b). Bulk RNA sequencing (RNA-seq) 
revealed the highest levels of Aire and TSA transcripts in the early-Aire 
(MHC-II*g"REP’) and late-Aire (MHC-II>8RFPphi8h) subsets and 
unsupervised hierarchical clustering showed that these populations 
were the most related to one another (Extended Data Fig. 1a, b). In the 
post-Aire subset, two distinct transcriptional signatures emerged. The 
first was enriched for markers of the soft cornified epithelial pathway’, 
including the late-stage cytokeratin, Krt10 (Fig. 1b, c). This is consistent 


, Bruno Kyewski’, David J. Erle?, Kristin A. Hogquist®, Lars M. Steinmetz°, Richard M. Locksley*** & 


with the observation of cornified bodies within the human thymus, 
known as Hassall’s corpuscles, and recent reports of cornified markers 
within the mouse thymus*'°. Immunofluorescence analysis confirmed 
robust expression of KRT10 protein in the thymus of wild-type mice 
and confocal microscopy images showed the distinctive morphology 
of the medullary KRT10* structures (KRT10 bodies) (Extended Data 
Fig. 2a). 

The second transcriptional signature included genes that are asso- 
ciated with an epithelial subset called tuft cells!! (Fig. 1b, d). Recent 
reports have shown that these cells have a non-redundant chemosen- 
sory role in the intestine in which they orchestrate a feedforward loop 
driving the type 2 response to helminths and protozoa’”""*. Tuft cells are 
notable for their expression of the canonical taste transduction pathway 
(that is, Gnat3, Plcg2 or Plcb2 and Trpm5), choline O-acetyltransferase 
(Chat), doublecortin-like kinase 1 (Dclk1) and as the sole source of 
the barrier-associated cytokine, 1125*'° (Fig. 1d). The downstream 
cation channel, Trpm5, is required for tuft function in the intestine, 
but the upstream sensory receptor(s) remains unknown, although 
some peripheral tuft cells express a limited repertoire of type-II taste 
receptors from the bitter ligand family (TAS2R)'®!”. Flow cytometric 
analysis of mT ECs demonstrated that approximately 10% of mTECs in 
the thymus of adult C57BL/6 mice were DCLK1>"8" and immunoflu- 
orescence staining showed that DCLK1>"8t mTECs were distributed 
throughout the medulla (Fig. le and Extended Data Fig. 2a, b). These 
cells frequently had a bulbous morphology, narrow protruding base, 
and were often grouped into small multicellular clusters'> (Fig. 1f and 
Extended Data Fig. 2b). Unexpectedly, DCLK1>"8"* cells were closely 
associated with KRT10 bodies and quantitative image analysis con- 
firmed that they were significantly more likely to be adjoining KRT10 
surfaces than predicted by random chance (Fig. 1g and Extended Data 
Fig. 3a-d). In the human thymus, medullary DCLK1>"®" cells regularly 
abutted Hassall’s corpuscles, and were 3.5% of CD45" EPCAM* TECs 
(Extended Data Fig. 4a, b). 

Although the presence of CHAT and GNAT3, and the expression 
of several TAS2R family members have previously been reported in 
AIRE” mTECs by Panneck et al.'8 and Soultanova et al.!°, the impor- 
tance of these observations has been unclear. Therefore, we wanted 
to understand how closely related DCLK1>"8" mTECs were to intes- 
tinal tuft cells. We used the Flare25 IL-25-RFP reporter to confirm 
that DCKL1>""t mTECs expressed IL-25'. RFP* cells were dispersed 
throughout the medulla and colocalized with DCLK1, confirming tonic 
IL-25 production, and were approximately 7-10% of mTECs (Fig. 2a 
and Extended Data Fig. 5a, b). We then used the Flare25 reporter to sort 
small-intestinal enterocytes (RFP™ ), small-intestinal tuft cells (RFP*) 
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Fig. 1 | Tuft-like cells are closely associated with cornified 

bodies in the thymic medulla. a, Gating of mTEC subsets within 

CD11c~ CD45" EPCAM* thymic epithelial cells. Sorted in quadruplicate 
for RNA-seq (12 pooled thymi per replicate, n = 4 sorted replicates). 

b, Heat map of differentially expressed genes (false-discovery rate 

(FDR) < 0.01 and |fold change| > 8). ¢, d, Selected genes from regions 
marked ‘cornified’ (c) or ‘tuft’ (d). log, fold change in expression relative 
to mean expression levels. e, DCLK1 intracellular staining in mTECs. 


and thymic RFP* cells for bulk RNA-seq and compared expression 
of 20 tuft markers'!° (Fig. 2b). Of these, 18 were highly expressed in 
small-intestinal tuft cells and thymic REP* mTECs, whereas none were 
expressed in RFP small-intestinal enterocytes. Notably, only RFP* 
mTECs strongly expressed Gnat3, the G, subunit of gustducin and 
upstream mediator of the taste signal transduction pathway, whereas 
both intestinal and thymic RFP* populations expressed Trpm5?)? 
(Fig. 2b). FACS and RT-qPCR analysis confirmed that RFP* mTECs 
were the dominant source of 1/25 and Trpm5 mRNA (Extended Data 
Fig. 5c, d). Finally, DCLK1>tight mTECs were also observed to be 
KRT8/18*, consistent with peripheral tuft cells (Extended Data Fig. 5e). 
These data confirm that thymic DCLK1>"8" cells represent a novel 
subset of tuft cells. 

Despite shared core transcriptional features, we hypothesized that 
thymic tuft cells might differ from their peripheral counterparts in 
important ways, such as the ability to present antigens. Small-intestinal 
tuft cells expressed MHC-I and 8,-microglobulin, however, only thymic 
tuft cells expressed MHC-II genes and CD74 (Fig. 2c and Extended 
Data Fig. 5f-h). Expression of MHC-II on the surface of thymic tuft 
cells was confirmed by flow cytometry, although overall surface levels 
were lower than AIRE* mTECH'#" cells ( Figs. 1a, 2d). Whereas small- 
intestinal tuft cells did not express any of the known TAS2R family 
members, at the bulk RNA level, thymic tuft cells expressed a broad 
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array of these receptors (Fig. 2e and Extended Data Fig. 5i). When 
examined at the single-cell level, Tas2r genes tended to be coexpressed 
in single cells, although Tas2r expression was far more heterogeneous 
than Dclk1 or Trpm5 expression (Fig. 2f and Extended Data Figs. 6a, 
7b). When coexpression of Tas2r genes was compared to the back- 
ground gene set, many Tas2r gene pairs were significantly highly corre- 
lated (empirical P< 0.05), suggesting that expression is not random and 
subsets of thymic tuft cells may be coordinated in their chemosensory 
specificities (Extended Data Fig. 6a-c). Cells that expressed TAS2R 
family members also tended to express Gnat3 and coexpress 3-8 TAS2R 
family members (Fig. 2f and Extended Data Fig. 7b). These unique 
properties of thymic tuft cells raised the possibility that they participate 
in antigen presentation to developing thymocytes and may use their 
chemosensory abilities as part of their thymic function. 

Notably, Aire~/~ animals had normal numbers of thymic tuft 
cells’? (Fig. 3a and Extended Data Fig. 7a). After tamoxifen treat- 
ment for 10 days, thymi from iALT x Flare25 reporter mice had an 
RFP* single-positive population as well as a clear double-positive 
RFP*zsGreen* mTEC population, confirming that some thymic tuft 
cells pass through an Aire-expressing stage and also suggesting that an 
Aire-independent pathway exists (Fig. 3b). This is consistent with the 
thymic tuft signature appearing in both the pre- and post-Aire popula- 
tions (Fig. 1b). At the single-cell level, Aire~/~ thymic tuft cells showed 
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Fig. 2 | Thymic DCLK1>"8"* cells are a novel subset of tuft cells. 

a, Colocalization of DCLK1 and RFP (IL-25) in the thymus of a Flare25 
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expression relative to mean expression levels. c, Expression (normalized 
reads from b) of MHC-II genes in small-intestinal and thymic tuft cells. 
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surface staining on small-intestinal and thymic tuft cells. n =3 mice, 
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a similar pattern of expression of core markers and TAS2R family 
members as in wild-type cells (Fig. 3c and Extended Data Figs. 7b-e, 
8a, b). However, principal component analysis showed that the major- 
ity of Aire-/~ thymic tuft cells were mostly separated from wild-type 
cells (Fig. 3d). Notably, Gnat3 was a driver of the variation observed in 
principal component 1 and showed heterogeneous expression across 
both wild-type and Aire~’~ cells (Fig. 3d). This separation was sup- 
ported by quantitative image analysis of Aire ’~ thymic sections that 
had a marked increase in the frequency and size of tuft cell aggregates 
(Fig. 3e). Notably, there was a marked decrease in KRT10 bodies 
in the thymus of Aire’~ mice; this may contribute to the observed 
disorganization of the thymic tuft compartment in these mice”? (Fig. 3e, f 
and Extended Data Figs. 3c, 7f). 

HIPK family members mediate signal integration and cell-fate deci- 
sions during the differentiation of complex tissues. Hipk2 has been 
shown to be an AIRE-binding partner, however, mTEC-specific abla- 
tion of Hipk2 had only a small effect on mTEC™S" gene expression, 
including TSAs”“. Instead, there was decreased expression of a subset 
of genes within MHC-II'°’ mTECs, including the majority of tuft-as- 
sociated genes. As such, we hypothesized that HIPK2 might promote 
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differentiation of thymic tuft cells. Indeed, Foxn1°*Hipk2™" mice had 
a profound reduction in the frequency of MHC-II'°“DCLK1>"8"* cells 
with a concomitant increase in MHC-II°" and MHC-II""DCLK14¢™ 
cells, suggesting a developmental blockade (Fig. 3g, h and Extended 
Data Fig. 7g). Therefore, we describe a novel Hipk2-dependent mech- 
anism for tuft cell differentiation. 

Because tuft cells drive intestinal type 2 immune responses, we 
hypothesized that they might promote a type 2 microenvironment 
in the thymus. Previous work has shown that type 2 invariant natural 
killer T (NKT2) cells are the dominant source of interleukin-4 (IL-4) 
in the medulla”?°. We used the BALB/cBy] x Aire?™® C57BL/6 Fy 
model, characterized by increased numbers of thymic NKT2 cells 
and IL-4-dependent EOMEStCD8* single-positive thymocytes, 
and ablated post-Aire tuft cells with a nine-day course of diphthe- 
ria toxin?® (Extended Data Fig. 9a, b). This resulted in a selective 
reduction in the number of NKT2 cells and EOMEStCD8t 
single-positive thymocytes” (Extended Data Fig. 9c, d). Importantly, 
there was a marked decrease in IL-4 production by the remaining 
NKT2 cells (Fig. 4a). Because this model ablates both mTEC?86 and 
post-Aire mTECs, we exploited the requirement for the transcrip- 
tion factor Pou2f3 for peripheral tuft cell development”. As in the 
periphery, we found no thymic tuft cells in Pou2f3~’~ mice but a 
normal frequency of AIRE* mTECs, grossly normal thymic archi- 
tecture and normal numbers of double-negative (CD4 CD8-), 
double-positive (CD4*CD8") and single-positive (CD4* or CD8*) 
thymocytes (Fig. 4b and Extended Data Fig. 10a—c). However, there 
were fewer TCRB'™CD1d* invariant natural killer T ((NKT) cells and, 
as in the F; model, there was a marked decrease in NKT2 cells and 
EOMES'CD8* single-positive thymocytes (Fig. 4c, d and Extended 
Data Fig. 10d-f). Notably, Trpm5~/~ mice phenocopied the thymo- 
cyte perturbations of Pou2f3~’~ mice, demonstrating that Trpm5 is 
required for thymic tuft cell function (Fig. 4c, d). Finally, the num- 
ber of splenic NKT2 cells was selectively reduced in Pou2f3~’~ mice, 
mirroring the thymic defect (Fig. 4e and Extended Data Fig. 10g). 
Together, these data show that thymic tuft cells are critical for the 
development and intrathymic function of NKT2s, require compo- 
nents of the canonical taste chemosensory pathway and contribute 
to the availability of thymic IL-4. 

Given that thymic tuft cells express surface MHC-II, they could 
present self-antigens not covered by the mTEC™8* population. To 
directly test this, we used a thymic transplant model in which thymi 
from C57BL/6 or Pou2f3~’~ neonatal mice were transplanted into 
athymic Foxn1 ~~ nude mice, allowed to reconstitute, and immunized 
with IL-25 protein. Remarkably, only Pou2/3~’~ transplant recipi- 
ents, which had peripheral tuft cells but lacked antigen expression 
by thymic tuft cells, generated an IL-25-specific antibody response 
(Fig. 4f). Therefore, thymic tuft cells are required to enforce tolerance 
to a canonical tuft-restricted antigen and probably serve this purpose 
for other such proteins. 

Our findings reveal a previously undescribed microanatomical 
niche consisting of terminally differentiated cornified epithelial cell 
aggregates and thymic tuft cells. Thymic tuft cells are a novel tuft 
subset that have characteristics of both mTECs and peripheral tuft 
cells. Their development is dependent on the transcription factors 
Pou2f3 and Hipk2, a known AIRE-binding partner. We demonstrate 
a role for thymic tuft cells in the establishment of an IL-4-enriched 
medullary microenvironment through NKT2-mediated IL-4 produc- 
tion. Thymic tuft cells depend on Trpm5 for their function, strongly 
indicating that they respond to local environmental cues through 
their chemosensory abilities. In summary, our findings provide a 
novel source of antigen that is required for the establishment of tol- 
erance and an upstream mediator of thymic IL-4 production through 
an innate-like (iNKT) lymphoid pathway. The former extends our 
knowledge of an established role for the thymus in immune func- 
tion and the latter underscores an emerging role for adaptive effector 
cytokines beyond peripheral immune responses, including thymocyte 
development. 
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Fig. 3 | Aire is expressed by some thymic 

tuft cells but is not required for their 
development. a, DCLK1tight mTEC counts. 
n=5 mice per genotype, 3 independent 
experiments. b, Left, mTECs from iALT;Flare25 
double-reporter mice after 10-day tamoxifen 
treatment. Right, MHC-II (I-A>) surface 
expression within each gated population. n=5 
mice, 2 independent experiments. c, Single- 

cell RNA seq of RFP* (IL-25*) Aire~/~ thymic 
tuft cells. Heat map shows gene expression of 
selected tuft and Tas2r genes. Columns are 
single cells (n= 95), values are mean-centred 
log-normalized. d, Left, principal component 
analysis of single-cell RNA-seq data from RFP* 
tuft cells. Right, data from the left plot with 
Gnat3 expression overlaid. e, Left, DCLK1 
staining with signal converted to Imaris 
surfaces; volumes >5x mean single-cell volume 
(1 x 10* 1m?) are pseudocoloured according 

to size. Right, pooled data. n= 3 mice per 
genotype. f, KRT10 surface area in thymic 
medulla (per unit area). Three regions were 
analysed per thymus. n = 4 mice per genotype, 
2 independent experiments. g, h, Analysis 

of DCLK1>"8"* cells in Hipk2" controls or 
Foxn1*Hipk2™" thymus. g, Confocal imaging 
of immunofluorescence staining. h, Frequencies 
of DCLK1"8*MHC-II'*" or DCLK1*MHC- 
II" mTECs from flow analysis. n= 6 mice, 
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Fig. 4 | Thymic tuft cells promote an IL-4-enriched medullary 
microenvironment and enforce tolerance. a, Thymus cells from 
diphtheria toxin-treated Kn2 BALB/cBy] x Aire?!® C57BL/6 F, mice 
(n= 10 mice) or non-transgenic controls (n = 22 mice) gated on TCRB™ 
CD1d* iNKT cells. Right, counts of IL-4-producing (human (h)CD2*) 
NKT2s (PLZF*). WT, wild type. b, DCLK1 intracellular staining in 
mTECs. n=5 mice; 3 independent experiments. c, Thymic NKT2 
(TCRB™CD1d*PLZF*ROR t ) cell counts in C57BL/6 (B6) (n=15 
mice), Pou2f3~/~ (n= 19 mice) and Trpm5~/~ (n= 10 mice) mice. 

d, Thymic EOMES*TCR8*CD8* single-positive (SP) cell counts in 
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C57BL/6 (n= 10 mice), Pou2f3~/~ (n=11 mice) and Trpm5~/~ (n=9 
mice) mice. e, Splenic total iNKT and NKT2 cell counts in C57BL/6 (n=5 
mice) and Pou2f3~/~ (1=4 mice) mice. f, Serum anti-IL-25 autoantibody 
indices in nude mice transplanted with C57BL/6 (n =3 mice) or Pou2f3 ri 
(n= 12 mice) neonatal thymus and immunized with IL-25 protein. IMM, 
immunized. Dashed line, the average of C57BL/6 autoantibody index 
values plus three standard deviations. RLBA, radioligand binding assay. 


a, c-f, Data are mean 4 


t s.d. a, e, Unpaired, parametric, two-tailed Student's 


t-test. c, d, One-way, non-parametic ANOVA (Kruskal-Wallis test). 
a, c, d, Pooled from three independent experiments. 
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METHODS 

Mice. Aire~~, Aire’=®", Aire>T®, iALT, Flare25, Hipk2™, Trpm5~/—, Kn2 BALB/ 
cBy] and Foxn1°" mice have been described previously*!?***5°, Pou2f3/— 
(Pou2f3'"! (KOMP) Veg, Project ID VG18280) sperm was obtained from the DTCC- 
KOMP2 Consortium and animals were rederived by the UCSF Rederivation Core. 
Flare25 and Trpm5~/~ mice were provided by J.v.M. and R.M.L. (UCSE). C57BL/6 
(JAX 000664), BALB/cBy] (JAX 001026), C57BL/6Cg-Gt(Rosa)*6Sertm (CAG tdTomato) Hee jy 
(JAX 007914), C57BL/6Cg-Gt(Rosa)26Sor'" (CAC ZsGreen Hz) (7.4 X 007906) mice 
were obtained from The Jackson Laboratory*". 

Most mice were maintained in the University of California San Francisco 

(UCSF) specific pathogen-free animal facility in accordance with the guide- 
lines established by the Institutional Animal Care and Use Committee (IACUC) 
and Laboratory Animal Resource Center and all experimental procedures were 
approved by the Laboratory Animal Resource Center at UCSF. Some mice were 
maintained in the German Cancer Research Center (DKFZ) specific pathogen-free 
animal facility in accordance with the European Convention for the Protection of 
Vertebrate Animals used for Experimental and Other Specific Purposes and the 
German Legislation. Some mice were maintained at the University of Minnesota 
under specific-pathogen-free conditions in accordance with the guidelines estab- 
lished by the [ACUC. Mice aged 4-8 weeks were used for all experiments unless 
otherwise specified in the text or figure legends. Mice were age-matched in figures 
displaying a single representative experiment and in pooled data. Mice were sex- 
matched when possible. 
In vivo mouse treatments. Diphtheria toxin (Sigma-Aldrich) was administered to 
Aire?™® mice at a dose of 25-50 ng g | every other day via intraperitoneal injections 
for a total of five injections for all experiments. For tamoxifen treatment of mice 
possessing conditional alleles, tamoxifen (Sigma-Aldrich) was dissolved in corn 
oil (Sigma-Aldrich) and 2-mg doses were administered by oral gavage with flexible 
plastic feeding tubes (Instech). For continuous treatment, animals were gavaged 
every other day until the experimental end point. 

Mice were immunized with 100 jg of recombinant mouse IL-25 protein in PBS 
(R&D Systems) emulsified 1:1 v/v in CFA containing 4 mg ml“! heat-inactivated 
Mycobacterium tuberculosis strain H37a (Chondrex). A total of 200 \1l was injected 
subcutaneously, divided equally between both sides. Three weeks later, serum was 
collected for analysis and stored at —20°C until radioligand-binding analysis. 
Bulk RNA-seq sample preparation and analysis. Single-cell epithelial suspen- 
sions were isolated and stained as described below and then sorted using a FACS 
Aria III or Aria Fusion (BD Biosciences). Cells were sorted directly into DMEM 
(ThermoFisher) containing 10% FBS, pelleted and snap-frozen in liquid nitrogen 
before total RNA extraction. For lineage tracing, approximately 80,000 pre-Aire, 
150,000 early-Aire, 150,000 late-Aire and 25,000 post-Aire cells were collected 
for each biological replicate. For Flare25 RFP* small-intestinal and thymic tuft 
cells, approximately 35,000 small-intestinal RFP* and 1,750 thymic RFP* cells 
were collected for each biological replicate. Intact mRNA was isolated using the 
Dynabead mRNA Purification Kit for total RNA (ThermoFisher) and amplified 
cDNA was prepared using the NuGen Ovation RNA-Seq system V2 kit, accord- 
ing to the manufacturer’s protocol (NuGen Technologies). Sequencing libraries 
were generated using the Nextera XT library preparation kit with multiplexing 
primers, according to manufacturer's protocol (Illumina). Library fragment-size 
distributions were assessed using the Bioanalyzer 2100 and the DNA high-sensitivity 
chip (Agilent Technologies). Library sequence quality was assessed by sequenc- 
ing single-end 50-bp reads using the Illumina MiSeq platform and libraries were 
pooled for high-throughput sequencing on the Illumina HiSeq 4000 by using equal 
numbers of uniquely mapped reads (Illumina). Then, 12 samples per lane were 
multiplexed to ensure adequate depth of coverage. The analytic pipeline included 
demultiplexing raw sequencing results, trimming adaptor sequences, and aligning 
to the reference genome. Sequence alignment and splice junction estimation was 
performed using the STAR (https://code.google.com/p/rna-star) software program. 
For differential expression testing, the genomic alignments were restricted to those 
that map uniquely to the set of known Ensembl IDs (including all protein-coding 
mRNAs and other coding and noncoding RNAs). STAR-aggregated mappings on 
a per-gene basis were used as raw input for normalization by DESeq2 software. 
Single-cell RNA-seq and computational analysis. Single-cell epithelial sus- 
pensions were isolated and stained as described below and sorted using a FACS 
Aria Fusion (BD Biosciences). Single cells were sorted into individual wells of a 
96-well plate (Bio-Rad) containing 5 1] of lysis solution and sequencing librar- 
ies were prepared using the smart-seq2 method as reported with the following 
modifications*”. Batch one, 0.1 jl of a 1:1,000,000 dilution of ERCC Spike-In Mix 
(Life Technologies) in RNase-free water was included in the reverse transcrip- 
tion master mix. Batch 2, 1 jl of a 1:1,000,000 dilution of ERCC Spike-In Mix 
(ThermoFisher) in RNase-free water was included in a total volume of 5 11 lysis 
buffer. For both batches, 21 cycles of initial PCR amplification were performed 
and a ratio of 0.56:1.0 (beads:total PCR volume; instead of 1.0:1.0) of Ampure 
XP beads (Beckman Coulter) was used for the first PCR purification to minimize 


primer dimer carryover. After the first PCR amplification, cDNA libraries were 
screened via quantitative PCR (we used a 1:10 dilution of purified cDNA libraries 
for quantitative PCR) for expression of a mouse housekeeping gene (Ubc), and the 
distribution of library size was checked on a Bioanalyzer instrument (Agilent). 
Only cDNA libraries that passed both quality controls were processed further. 
Subsequently, 50 pg of cDNA was used for the ‘tagmentation’ (transposase-based 
fragmentation) reaction and 12 cycles were applied for the final enrichment PCR. 
The final purification step was performed with a ratio of 0.6:1.0 (as above) of 
Ampure XP beads (Beckman Coulter). 

In batch one, 96 cells were initially ‘multiplexed’ with 48 samples per Illumina 
HiSeq 2500 lane. Then, 31 additional cells were multiplexed with 96 samples per 
Illumina HiSeq 2500 lane and 58-bp single-end sequencing was performed. We 
observed no batch effect between these two sets. In batch two, 92 Flare25 cells and 
96 Flare25 Aire~/~ cells were multiplexed with 96 samples per Illumina HiSeq 2500 
lane. A negative control and a 50-cell positive control were sequenced from each 
sample. Flare25 and Flare25 Aire~/~ cells were split evenly between sequencing 
lanes and 85-bp single-end sequencing was performed. HiSeq sequencing lanes 
typically yielded between ~150 x 10° and ~200 x 10° reads for both batches. 

Raw data reads were aligned to the GRCm38 build of the Mus musculus genome 
using the STAR alignment software*’. Cells were required to have a minimum of 
40% total reads mapping to the genome, which all of our cells met. Reads uniquely 
aligned to mRNA genes were counted with featureCounts, a part of the Subread 
software package**. Genes present in a minimum of five cells and with at least 10 
reads were retained in the analysis. Cells expressing fewer than 750 genes with 10 
reads were removed from the analysis. All cells were required to have a minimum 
of 100,000 total reads mapping to genes, and less than 10% of reads mapping to 
mitochondrial genes. This resulted in 197 Flare25 cells and 95 Flare25 Aire~’~ cells 
for further downstream analysis. 

Data were normalized using the indeXplorer pipeline (https://git.embl.de/ 
velten/indeXplorer)*. In brief, data were fitted to error models using the SCDE 
software to model dropout events and overdispersion of genes and the posterior 
odds ratio was used as a normalized measure of gene-expression values***®. 

The effect of mitochondrial read counts were regressed out before all further 
downstream analysis. Principle component analyses were performed on the total 
set of cells, including both Flare25 and Flare25 Aire~’~ cells, using the ‘preomp’ 
function from the ‘stats’ R package. Heat maps were generated by calculating 
the mean centred expression values for each gene across all cells and clustered 
within genotype (independently for Flare25 and Flare25 Aire~’~) using hierarchi- 
cal clustering with Ward linkage (‘hclust’ from the ‘stats’ R package). Genes were 
ordered by hierarchical clustering with Ward linkage across all cells. Histograms 
were generated by counting the number of cells with the associated number 
of Tas2r genes present or not present (defined as normalized expression value 
greater than 0). Correlation values between Tas2r gene pairs were calculated using 
Spearman's rank correlation. Statistically significant correlations were established 
by constructing an empirical cumulative distribution function for all pairwise 
correlation values for all genes. Pairwise correlation values falling within the high 
correlation cutoff of 0.05 were determined significantly high by an empirical 
P value of P< 0.05. 

Immunofluorescence staining and imaging. For immunofluorescence, tissues 
were fixed in 2% paraformaldehyde (Pierce) in PBS for 2 h at 4°C followed by 
overnight incubation in 30% (w/v) sucrose (Sigma-Aldrich) in PBS. Tissues were 
embedded in Optimal Cutting Temperature Compound (Tissue-Tek) and stored 
at —80°C before sectioning (50-200 jm) on a cryostat (Leica). Thin sections were 
dried on Superfrost Plus (Fisher Scientific) slides and semi-thick (200 jm) sections 
were moved directly to 0.3% Triton X-100 (Sigma-Aldrich), 0.2% BSA (Sigma- 
Aldrich), 0.1% sodium azide (Sigma-Aldrich) in PBS (Immunomix). Slides were 
stained in a humidified chamber and semi-thick sections were stained in 24-well 
plates with one section per well. Slides were briefly rehydrated in PBS before per- 
meabilization in Immunomix for 1 h at room temperature followed by blocking 
with BlockAid (ThermoFisher), primary antibody staining at room temperature, 
and, when necessary, secondary antibody staining at room temperature for 1 h. 
Semi-thick sections were permeabilized in Immunomix with shaking at room tem- 
perature overnight followed by blocking with BlockAid at room temperature for 
2h, primary antibody staining at room temperature for 2 h, and, when necessary, 
secondary antibody staining at room temperature for 2 h. Semi-thick sections were 
then moved to Superfrost Plus slides and all sections were mounted with ProLong 
Diamond Antifade Mountant (ThermoFisher). Images were acquired on a Leica 
SP5 (Leica) laser scanning confocal microscope. The following antibodies were 
used in this study: DCLK1 (Abcam polyclonal ab31704), KRT10 (EP1607IHCY), 
KRT5 (EP1601Y), KRT8 (EP1628Y), tdTomato (632496). Antibodies were pur- 
chased from Abcam or Clontech. 

Human thymic tissue. Thymus samples were collected from paediatric patients 
undergoing corrective cardiothoracic surgery. All experiments were performed 
in accordance with protocols approved by the UCSF Human Research Protection 
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Program Institutional Review Board and written informed consent was obtained 
before sample collection. 

Immunohistochemistry. Tissue was fixed in 4% paraformaldehyde 
(ThermoFisher) for 3 h at 4°C, washed with PBS, embedded in paraffin and sec- 
tioned. Antigen retrieval was performed on rehydrated tissue by boiling sections 
in antigen retrieval Citra Solution (Biogenex). Sections were blocked for 30 min 
at room temperature using CAS-Block (ThermoFisher) with 0.2% Triton X-100 
(Sigma-Aldrich), followed by incubation with anti-DCLK1 primary antibody 
(Abcam, ab31704) overnight at 4°C. Staining with biotinylated secondary antibody 
was performed for 1 h at room temperature. Slides were developed using an ABC 
kit (Vector labs) and DAB kit (Vector labs) and counterstained with haematoxylin. 
Image processing and quantitative image analysis. The spatial distribution of 
tuft cells was evaluated using the Spatstat R library. Raw Leica image files of three 
large scans (3,670 x 3,670 x 77 {1m) were imported into Imaris 8.3.1 (Bitplane). 
Intensity-based three-dimensional segmentation was then applied to 12 identi- 
cally sized regions of interest (586 x 272 x 77 1m) using identical thresholding 
parameters to generate surface coordinates of KRT10 bodies and centre of intensity 
coordinates of tuft cells. These coordinates were flattened to two dimensions (xy) 
and duplicates were removed. Two-dimensional coordinates were imported into 
R for spatial analysis and the relationship between surfaces and centre of intensity 
coordinates was calculated using the multi-type pair correlation function (PCF) 
in R, pcfcross (http://spatstat.org/). The pair-correlation function is defined as 
g=K'(r)/2a1, where K’(r) is the first derivative of Ripley’s K-function. Because 
PCE analysis is sensitive to spatial inhomogeneities within a field, the 12 ROIs were 
selected to avoid inhomogeneities at the field margins. 

To measure the range of cluster sizes of the tuft cell aggregates, raw Leica 
images of fixed thymic slices were analysed in the Imaris v.8.3.1 software package 
(Bitplane). Intensity-based three-dimensional segmentation was applied to the 
image voxels and volumes of the tuft cells were measured. Surface volumes <5 
times the mean single-cell volume (1 x 10* um?) were shown in green and surface 
volumes >5 (1 x 10* 1m) were pseudocoloured based on aggregate size. 
Single-cell tissue preparation. Mouse thymi were isolated, cleaned of fat and 
transferred to DMEM (UCSF Cell Culture Facility) containing 2% FBS (Atlanta 
Biologics) on ice. Single-cell suspensions of thymocytes were prepared by mashing 
through a 40-\.m filter (Falcon). For mTEC preparations, thymi were minced with 
a razor blade. Up to five thymi were pooled into a single digestion. Tissue pieces 
were moved with a glass Pasteur pipette to 15-ml tubes and vortexed briefly in 
10 ml of medium. Fragments were allowed to settle before removing the medium 
and replacing it with 4 ml of digestion medium containing 2% FBS, 100 jg ml! 
DNase I (Roche) and 100 jig ml”! liberase TM (Sigma-Aldrich) in DMEM. Tubes 
were moved to a 37°C water bath and fragments were triturated through a glass 
Pasteur pipette at 0 min and 6 min to mechanically aid digestion. At 12 min, tubes 
were spun briefly to pellet undigested fragments and the supernatant was moved 
to 20 ml of 0.5% BSA (Sigma-Aldrich), 2 mM EDTA (TekNova) in PBS (MACS 
buffer) on ice to stop the enzymatic digestion. This was repeated twice for a total 
of three 12-min digestion cycles, or until there were no remaining tissue frag- 
ments. The single-cell suspension was then pelleted and washed once in MACS 
buffer. Density-gradient centrifugation using a three-layer Percoll gradient (GE 
Healthcare) with specific gravities of 1.115, 1.065 and 1.0 was used to enrich for 
stromal cells. Cells isolated from the Percoll-light fraction, between the 1.065 and 
1.0 layers, were then resuspended in MACS buffer and counted. 

Human thymus was transferred to RPMI (ThermoFisher) and cut into small 
pieces using scissors. Tissue pieces were mashed gently using the back of a ster- 
ile syringe to extract most of the thymocytes. The supernatant was removed and 
replaced with fresh RPMI. Tissue pieces were moved to a 15-ml tube using a 5-ml 
pipette. Fragments were allowed to settle before discarding the medium and replac- 
ing it with 5 ml of digestion medium containing 100 jpg ml~! DNase I (Roche) and 
100 pg ml“! Liberase TM (Sigma-Aldrich) in RPMI. Tubes were moved to a 37°C 
water bath and fragments were triturated through a 5-ml pipette at 0 min and 
6 min to mechanically aid digestion. At 12 min, tubes were spun briefly to pellet 
undigested fragments and the supernatant was discarded. Fresh digestion medium 
was added to remaining fragments and the digestion was repeated using a glass 
Pasteur pipette for trituration. Supernatant from this second round of digestion 
was also discarded. A third round of enzymatic digestion was performed using 
5 ml of digestion medium supplemented with trypsin-EDTA (ThermoFisher) for 
a final concentration of 0.05%. Remaining thymic fragments were digested for an 
additional hour with trituration every 15 min. The cells were moved to cold MACS 
buffer to stop the enzymatic digestion. TECs were enriched and prepared for flow 
cytometry using the same protocol as mouse mTECs with the following modifica- 
tions: blocking was done with human Fc Receptor Binding Inhibitor Monoclonal 
Antibody (eBioscience) and human-specific antibodies against EPCAM, CD45 
and HLA-DR were used for surface staining. 

Small intestines were flushed with PBS, opened and rinsed with PBS to 
remove luminal contents. Subsequently, 2.5-5-cm-long segments of jejunum 
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were incubated with rocking for 20 min at 37°C in 5 ml PBS containing 2.5 mM 
EDTA (Sigma-Aldrich), 0.75 mM dithiothreitol (Sigma-Aldrich), and 10 jg ml“! 
DNasel (Sigma-Aldrich). Tissues were shaken vigorously for 30 s and released 
cells were incubated with rocking for 10 min at 37°C in 5 ml HBSS (Ca?*/Mg?* 
free) containing 1.0 U ml”! dispase (Gibco) and 10 ppg ml“! DNasel. Digested 
cells were passed through a 70-|1m filter and washed once before staining for flow 
cytometry. 

Flow cytometry and antibodies. For surface staining, single-cell suspensions were 
prepared as described and incubated with Live/Dead Fixable Blue Dead Cell Stain 
(ThermoFisher) in PBS for 20 min at 4°C followed by blocking with anti-mouse 
CD16/CD32 (24G2) (UCSF Hybridoma Core Facility) and 5% normal rat serum 
for 20 min at 4°C. Cells were then washed in FACS buffer and stained for surface 
markers for 30-45 min at 4°C. For CD1d tetramer staining, thymocytes were 
incubated with BV421-conjugated PBS-57-loaded CD1d tetramer (NIH Tetramer 
Core) for 1 h at room temperature. DCLK intracellular staining of mTECs was per- 
formed as described previously". For staining of intracellular transcription factors, 
thymocytes were fixed and permeabilized with FoxP3 staining buffer set (eBio- 
science) according to the manufacturer’s instructions. Flow cytometry data were 
collected on a LSRII Flow Cytometer (BD Biosciences) housed within the UCSF 
Single Cell Analysis Center, and analysed using FlowJo 10.3 software (TreeStar 
Software). The following antibodies were used in this study: DCLK1 (Abcam pol- 
yclonal ab31704), DCLK1 (EPR6085) Ly51 (6C3), CD11c (N418), CD45 (30-F11), 
EpCAM (G8.8), AIRE (5H12), I-A? (25-9-17), I-A‘ (39-10-8), TCR3 (H57-597), 
CD4 (GK1.5), CD8 (53-6.7), FoxP3 (MF-14), CD25 (3C7), PLZF (Mags21F7), 
T-bet (4B10), RORY (Q31-378), EOMES (Dan11mag), CD2 (S5.2) human EPCAM 
(HEA-125), human CD45 (HI30), human HLA-DR (L243). Antibodies were pur- 
chased from Abcam, BioLegend, BD Biosciences, eBioscience or Miltenyi. 
Quantitative RT-PCR. Single-cell epithelial suspensions were isolated and stained 
as described above and then sorted using a FACS Aria Fusion (BD Biosciences). 
Cells were sorted directly into TRIzol LS (ThermoFisher Scientific) and the aque- 
ous phase was isolated using Phasemaker Tubes (ThermoFisher) and RNA was 
isolated using the Micro Plus RNeasy kit (Qiagen) and reverse transcribed using 
the SuperScript IV Vilo Master Mix with ezDNase (ThermoFisher). The resulting 
cDNA was used as template for quantitative PCR with Taqman Gene Expression 
Assays on a 7500 Fast Real-Time PCR System (Applied Biosystems). Transcripts 
were normalized to Actb (Mm00607939_s1) expression. Aire (Mm00477461_m1), 
Dclk1 (Mm00444950_m1), Epcam (Mm00493214_m1), 1/25 (Mm00499822_m1) 
and Trpm5 (Mm01129032_m1) primers were used. 

Radioligand-binding assay. Full-length mouse 1/25 cDNA (MMM1013- 
211691449, Dharmacon) was transcribed, in vitro translated and biosyntheti- 
cally labelled with *°S-methionine using the TNT Quick Coupled Transcription/ 
Translation System (Promega). Radiolabelled protein was immunoprecipi- 
tated with serum samples loaded in triplicate in 96-well PVDF filtration plates 
(Millipore). Radioactivity retained in the filter was measured using a liquid scin- 
tillation counter (1450 MicroBeta Trilux; PerkinElmer). A polyclonal goat anti- 
mouse IL-25 antibody (R&D Systems, af1399) was used as a positive control. IL-25 
autoantibody indices for samples were calculated as: (c.p.m. of unknown - c.p.m. of 
negative standard)/(c.p.m. of positive standard - c.p.m. of negative standard) x 100. 
The limit of detection was defined as the mean of autoantibody indices for all wild- 
type samples plus three standard deviations. 

Statistical analysis. All experiments were performed using randomly assigned 
mice without investigator blinding. No data were excluded. Statistical significance 
between two groups was calculated using an unpaired, parametric, two-tailed 
Student's t-test. Where more than two groups were plotted together, statistical 
significance was calculated either by using one-way, non-parametic ANOVA 
(Kruskal-Wallis test) or using two-way non-parametric ANOVA with multiple 
comparisons. Experimental groups included a minimum of three biological repli- 
cates. Intragroup variation was not assessed. All statistical analysis was performed 
using Prism 7 (GraphPad Software), except where stated above. Figures display 
mean + s.d. A P value of less than 0.05 was considered statistically significant. No 
statistical methods were used to predetermine sample size. 

Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 

Code availability. Code used to align, perform quality control and analyse 
single-cell RNA-seq data can be found at https://github.com/mTEC-pipelines/ 
tufts-pipelines. Bulk RNA-seq and quantitative image analysis pipelines are 
described and code is available upon request or through publicly available GitHub 
repositories. 

Data availability. RNA-seq data that support the findings of this study have been 
deposited in the GEO database under accession numbers GSE114651, GSE114668 
and GSE114895. Source Data for Figs. le, 2c, 3e, f, h, 4c-f and Extended Data 
Figs. 5d, f-i, 9b-d, 10b-e, g are provided in the online version of the paper. All 
other data that support the findings of this study are available from the correspond- 
ing author upon reasonable request. 
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Heatmap of DE (FDR < 0.01, IFCI > 8) 
(union of all comparisons) 


Pre Post Late Early 
Extended Data Fig. 1 | RNA-seq of FACS-sorted iALT mTEC subsets. Columns are organized by unsupervised hierarchical clustering (union 
a, Heat map of Aire and representative Aire-dependent TSAs in pre-, of all comparisons) with a dendrogram representing similarity between 
early-, late- and post-Aire populations as defined in Fig. 1a. b, Heat map clustered columns. Note that the early- and late-Aire-expressing 
of differentially expressed genes with a FDR < 0.01 and [fold change| > 8. populations are grouped, as expected. 
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Extended Data Fig. 2 | Medullary localization of DCLK1>"#"* cells and b, Left, KRT5 (red) and DCKLI (green). Right, DCLK1] alone. Region was 
KRT10 bodies. Representative confocal maximum projections (25 ,1m) of __ selected for presence of multicellular DCLK1>"8" cell clumps as indicated 
immunofluorescence staining of thymic slices at low magnification. by white arrows. Scale bars, 100 jum. n =3 mice; images are representative 
a, KRT5 (red) and KRT10 (green) (left) and KRT5 (red) and DCLK1 of three independent experiments. 

(green) (right) showing medullary localization of KRT10 and DCLK1. 
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Extended Data Fig. 3 | Non-random distribution of medullary 
DCLK1>ti8ht cells. a, Stitched confocal maximum projection (77 jm) of 
immunofluorescence staining of semi-thick (200 1m) thymic slice from a 
C57BL/6 mouse at low magnification. KRT10 (red) and DCLK1 (green). 
Field, 3,670 x 3,670 jm. Small rectangles indicate volumes selected for 
quantitative image analysis from this slice. b, Expanded area of image from 
a indicated in lower left corner with KRT10 signal converted into surfaces 
and DLCK1 signal converted to centre of intensity coordinates. Scale 

bar, 100 um. ¢, Pair correlation function (PCF) analysis of 12 identically 
sized regions of interest (586 x 272 x 77 xm) from three different stitched 
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Imaris surfaces 


3.5 


Poisson Model Observed Distribution 


thymic slices (n = 3 thymic slices). Results are presented as a histogram 

for a range of distances from KRT10 surfaces and are interpreted as 
follows: g= 1 indicates a spatial distribution that follows a random Poisson 
distribution, illustrated by a dashed line and grey envelope; g < 1 indicates 
regularity in the distribution; g> 1 indicates clustering. d, Schematic 
representation of the PCF analysis. The PCF counts the number of objects 
from a surface at a radial distance (r) and compares this number to the 
expected number of events for a random Poisson-distributed population at 
this distance. 
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Neonatal Human Medulla 


b CD45 EPCAM* Human mTECs 


HLA-DR-APC-Fire750 
Extended Data Fig. 4 | Human thymus contains medullary DCLK1>"8** =~ human thymus (22.3 gestational weeks) gated on CD45~-EPCAM* 


cells. a, Representative immunohistochemistry of neonatal (21-day-old) TECs showing intracellular DCLK1 or isotype control. Note that human 
human thymus stained for DCLK1 or isotype control. Arrows point to DCLK1* events are approximately 3.5% of prenatal TECs. Data are 
individual DCLK1>"** cells or clusters. HC, Hassall’s corpuscle. Scale representative of two independent experiments. 


bars, 25 jum. b, Flow cytometry plot from enzymatically digested prenatal 
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Extended Data Fig. 5 | Characterization of thymic tuft cells. 

a, DCLK1 and RFP (IL-25) in the thymus of C57BL/6 control and Flare25 
mice. Confocal maximal projection. Scale bar, 5 um. n =3 mice, two 
independent experiments. b, RFP (IL-25) in (CD11c7- CD45" EPCAM*) 
mTECs from C57BL/6 control and Flare25 mice. n=5 mice; three 
independent experiments. c, Gating strategy for FACS sorting of 

CD11c- CD45" EpCAMLy51~ mTECs from Flare25-reporter mice for 
qPCR analysis of mTEC8" (REP- MHC-II"8"), mTEC!’ (RFP+MHC- 
11°), and thymic tuft (RFP*) populations. d, qPCR analysis of expression 
of indicated genes of interest on populations sorted in c normalized to 
mTEC"8", Data are mean + s.d. n = 3 mice. Two-way non-parametric 
ANOVA with multiple comparisons correction. e, Representative confocal 
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maximum projection (10 zm) stained for KRT8/KRT18 (red) and DCLK1 
(green). Scale bar, 50 jum. n= 3 mice, two independent experiments. 

f-i, Expression levels (normalized reads from Fig. 2b) from bulk RNA 
sequencing of small-intestinal (SI; n = 3 mice) and thymic tuft (n =4 
mice) cells. f, Expression of major MHC-I genes and B2m. FDR> 0.1. 

g, Expression of major MHC-II genes and Cd74 in small-intestinal 

tuft cells. h, Expression of minor MHC-II genes in thymic tuft cells. 

i, Expression of Tas2r family members in thymic tuft cells. g-i, Data 

are mean +s.d. g, i, The red line corresponds to a cut-off of 5 reads per 
million; data for which the mean +s.d. fall above this cut-off are indicated 
with an asterisk. 
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Extended Data Fig. 6 | Tas2r expression in single thymic tuft cells is not 
stochastic. a, Correlation plot of Tas2r genes in single Flare25 tuft cells as 
analysed in Fig. 2f. Colour and circle size represent pairwise correlation 
value. b, Empirical cumulative distribution function (ECDF) plot of 


Gene 1 Gene 2 Empirical P value 
Tas2r104 Tas2r116 0.0470 
Tas2r115 Tas2r116 0.0448 
Tas2r137 Tas2r118 0.0309 
Tas2r102 Tas2r118 0.0356 
Tas2r138 Tas2r118 0.0283 
Tas2r107 Tas2r119 0.0231 
Tas2r104 Tas2r119 0.0308 
Tas2r104 Tas2r105 0.0447 
Tas2r102 Tas2r107 0.0484 
Tas2r104 Tas2r107 0.0264 
Tas2r104 Tas2r102 0.0457 
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pairwise gene expression correlation of single Flare25 tuft cells. Red points 
represent Tas2r genes that were significantly highly correlated compared 
to the background gene set (P< 0.05). ¢, List of correlated Tas2r gene pairs 
from b and their corresponding empirical P values. 
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Extended Data Fig. 7 | Analysis of Tas2r expression in single Flare25 
and Flare25;Aire~/— thymic tuft cells. a, Intracellular DCLK1 in 
C57BL/6 control and Aire~/~ mTECs (CD11c~ CD45~ EPCAM*). n=5 
mice; three independent experiments. b, Histogram showing the number 
of Tas2r family members expressed in each individual thymic tuft cell. 
Note that the distribution is similar between wild-type and Aire~/~ cells. 
c, Correlation plot of Tas2r genes in single Flare25;Aire ~~ tuft cells as 
analysed in Fig. 3c. Colour and circle size represent pairwise correlation 
values. d, Empirical cumulative distribution function plot of pairwise 
gene expression correlation of single Flare25;Aire~’~ tuft cells. Red points 


represent Tas2r genes that were significantly highly correlated compared 
to the background gene set (P < 0.05). e, List of correlated Tas2r gene 
pairs from d and their corresponding empirical P values. f, Representative 
confocal maximum projections (25 1m) of immunofluorescence analysis 
of C57BL/6 control and Aire~/~ thymi stained for KRT5 (red) and KRT10 
(green). Scale bars, 100 jum. n =3 mice; two independent experiments. 

f, Analysis of DCLK1* cells in Hipk2" controls or Foxn1°Hipk2/! 
(mTEC Hipk2~/~) thymus gated on CD45~- CDR1- EPCAM* mTECs as 
quantified in Fig. 3h. n =6 mice; three independent experiments. 
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Total Features 
Extended Data Fig. 8 | Single-cell RNA-seq data quality and inclusion 
criteria. a, Histograms of total read counts (left) or features (right) 
for each single cell. Red represents Flare25 thymic tuft cells and blue 
represents Flare25;Aire/— thymic tuft cells. Cells with fewer than 100,000 
reads or 750 detected features were discarded from further analysis. 


b, Scatter plot of total features versus mitochondrial read percentage. Red 
represents Flare25 thymic tuft cells and blue represents Flare25;Aire~/~ 
thymic tuft cells. Triangles and circles denote cells from two separate sorts. 
Cells with more than 10% of their reads mapping to mitochondrial genes 
were discarded from further analysis as indicated by the black line. 
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Extended Data Fig. 9 | Diphtheria toxin ablation of thymic tuft cells 

in Kn2 BALB/cBy] x Aire?! C57BL/6 F, mice disrupts the medullary 
NKT2-IL-4 axis. a-d, Analysis of thymus from Kn2 BALB/cBy] x Aire??® 
C57BL/6 F; mice or Aire?’ transgene-negative F, controls treated 

for 9 days with diphtheria toxin. a, Immunofluorescence staining of 

thin thymic sections for KRT10 (red) and DCLK1 (green). n =3 mice; 
two independent experiments. b, Counts of CD11c” CD45" EPCAMt 
mTECs and DCLK1* tuft cells in non-transgenic control thymus (n= 9 
mice) or F; thymus (m= 12 mice). c, Flow cytometry plots gated on 
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TCRB™CD1d* iNKT cells showing intracellular PLZF and ROR‘t staining 
for iNKT subset analysis. Right, counts of NKT1 (PLZF- ROR yt”), NKT2 
(PLZF*ROR yt”) and NKT17 (PLZF- ROR‘t*) in non-transgenic control 
thymus (n= 35 mice) or F; thymus (n= 23 mice). d, Flow cytometry plots 
gated on TCR3*CD8* single-positive thymocytes showing intracellular 
EOMES staining. Right, counts of EOMES* cells in non-transgenic 
control thymus (n= 20 mice) or F, thymus (n= 18 mice). b-d, Data are 
mean +s.d. Unpaired, parametric, two-tailed Student’s t-test. Pooled from 
three independent experiments. 
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Extended Data Fig. 10 | Characterization of Pou2f3~/— mice. 

a, Representative on-edge, full-face mid-thymic sections (5 1m) stained 
with haematoxylin and eosin showing grossly normal thymic architecture 
in Pou2f3 ~~ mice. n= 3 mice; two independent experiments. b, Flow 
cytometry plots (left) and frequencies (right) of mTEC subsets from 
C57BL/6 control and Pou2f3~’~ mice. n=5 mice per genotype; three 
independent experiments. c, Flow cytometry plots (left) and counts (right) 
of thymocyte subsets from C57BL/6 control and Pou2f3~/~ mice. n=5 
mice per genotype; three independent experiments. d, Flow cytometry 
plots (left) of TCR8'"CD1d* thymic iNKT cells in C57BL/6 control 

(n= 26 mice), Pou2f3 ~~ (n= 19 mice) or Trpms/— (n= 10 mice) mice. 
e, Gating strategy of iNKT subset analysis (left) and counts (right) of 


NKT1 (PLZF-ROR»t~) and NKT17 (PLZE-RORt*) in C57BL/6 control 
(n=15 mice), Pou2f3~/— (n= 19 mice) or Trpms~/— (n= 10 mice) mice. 
f, Flow cytometry plots gated on TCR3*CD8>* single-positive thymocytes 
showing intracellular EOMES staining. Quantified in Fig. 4d. g, Flow 
cytometry plots gated on splenic TCR8™CD1d* iNKT cells showing 
intracellular PLZF and ROR‘ staining for iNKT subset analysis, NKT1 
(PLZE-ROR4t~), NKT2 (PLZE*ROR4t~) and NKT17 (PLZE~ROR}t*). 
Right, counts of NKT1 and NKT17 in C57BL/6 control (n=5 mice) or 
Pou2f3 ~/~ (n=4 mice) mice. Two independent experiments. b-e, g, Data 
are mean + s.d. b, ¢, g, Unpaired, parametric, two-tailed Student's t-test. 
d, e, One-way, non-parametic ANOVA (Kruskal-Wallis test). Pooled from 
three independent experiments. 
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Histidine catabolism is a major determinant of 


methotrexate sensitivity 


Naama Kanarek! 4, Heather R. Keys!, Jason R. Cantor!**4, Caroline A. Lewis!, Sze Ham Chan!, Tenzin Kunchok!, 
Monther Abu-Remaileh!2**, Elizaveta Freinkman!, Lawrence D. Schweitzer* & David M. Sabatini!23:4# 


The chemotherapeutic drug methotrexate inhibits the enzyme 
dihydrofolate reductase!, which generates tetrahydrofolate, 
an essential cofactor in nucleotide synthesis”. Depletion of 
tetrahydrofolate causes cell death by suppressing DNA and RNA 
production*. Although methotrexate is widely used as an anticancer 
agent and is the subject of over a thousand ongoing clinical trials‘, 
its high toxicity often leads to the premature termination of its 
use, which reduces its potential efficacy*. To identify genes that 
modulate the response of cancer cells to methotrexate, we performed 
a CRISPR-Cas9-based screen®”. This screen yielded FTCD, which 
encodes an enzyme—formimidoyltransferase cyclodeaminase— 
that is required for the catabolism of the amino acid histidine’, 
a process that has not previously been linked to methotrexate 
sensitivity. In cultured cancer cells, depletion of several genes in 
the histidine degradation pathway markedly decreased sensitivity 
to methotrexate. Mechanistically, histidine catabolism drains the 
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cellular pool of tetrahydrofolate, which is particularly detrimental to 
methotrexate-treated cells. Moreover, expression of the rate-limiting 
enzyme in histidine catabolism is associated with methotrexate 
sensitivity in cancer cell lines and with survival rate in patients. In 
vivo dietary supplementation of histidine increased flux through 
the histidine degradation pathway and enhanced the sensitivity of 
leukaemia xenografts to methotrexate. The histidine degradation 
pathway markedly influences the sensitivity of cancer cells to 
methotrexate and may be exploited to improve methotrexate efficacy 
through a simple dietary intervention. 

To identify genes that contribute to the response of cancer cells 
to methotrexate, we performed a genome-wide, positive-selection 
CRISPR-Cas9-based screen®” in the erythroleukaemia cell line HEL. 
We selected this cell line because of its high sensitivity to methotrexate 
in a competitive growth assay of 42 haematopoietic cell lines in 
the presence of methotrexate (Fig. la, Extended Data Fig. la-c). 
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Fig. 1 | Loss of FTCD decreases the sensitivity of cancer cells to 
methotrexate. a, Selection of the HEL cell line for the CRISPR-Cas9- 
based screen. The genomes of 42 haematopoietic cancer cell lines were 
individually barcoded. The cell lines were pooled together and treated 
with 0, 0.1, 0.5 and 51M methotrexate for 6 days. Genomic barcodes 
were sequenced to determine the relative representation of each line 

in the mixed culture at the various methotrexate concentrations. The 
erythroleukaemia HEL cell line was identified as a sensitive cell line 
suitable for a genome-wide, positive-selection CRISPR-Cas9-based screen. 
b, The two top hits in the CRISPR-Cas9-based screen’ were SLC19A1 
and FTCD. Genes were ranked by the difference between their CRISPR 
score’? in the methotrexate-treated and the vehicle-treated samples. 
c-e, Targeting FTCD by CRISPR-Cas9 in HEL cells decreased their 
sensitivity to methotrexate. c, Fold change in the ECop of methotrexate 
in HEL cells that were treated with methotrexate for 5 days and stably 
expressing the indicated constructs. ECoo values for methotrexate are 
relative to wild-type (WT) cells (n =3, except for SLC19A1 for which 


n=2, biological replicates). d, HEL cells stably expressing the indicated 
constructs were counted daily to assess their survival after treatment 

with 51M methotrexate (n = 3, biological replicates). e, Differential 
interference contrast micrographs of HEL cells stably expressing the 
indicated constructs and treated with 5\1M methotrexate for 3 days. Scale 
bar, 100 1m. A representative experiment is presented (n = 3, biological 
replicates). f, CRISPR-Cas9-depletion of FTCD decreased the sensitivity 
of additional cell lines (Ramos and LAMA84) to methotrexate. Shown 

are fold changes in the ECoo values of methotrexate and the control drug, 
doxorubicin, compared to wild-type cells (n = 3, biological replicates, 
ordinary one-way ANOVA, comparing sgFTCD to each of the other 
samples. For doxorubicin all P values were not significant). In ¢, d, f, error 
bars indicate s.e.m. Abbreviations: mFTCD, cells stably expressing mouse 
Ftcd cDNA that is not targeted by the sgRNA used in this experiment; 
sgAAVS, cells stably expressing an sgRNA targeting the non-coding 
AAVS locus®’; sgFTCD, cells stably expressing an sgRNA targeting FTCD; 
sgSLC19A1, cells stably expressing an sgRNA targeting SLC19A1. 
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Fig. 2 | FTCD depletion enables cancer cells to maintain THF pools 
and nucleotide synthesis even when treated with methotrexate. a, The 
histidine degradation pathway. FTCD has two functions in the histidine 
degradation pathway. The formimidoyltransferase domain metabolizes 
THE and the histidine breakdown product FIGLU to produce glutamate 
and 5-formimino THE, and the cyclodeaminase domain metabolizes 
5-formimino THF to 5,10-methenyl THF". b, CRISPR-Cas9-induced 
FTCD depletion increased levels of histidine (top), decreased levels of 
5-formyl THF (bottom) in HEL and Ramos cells, and decreased levels 
of 5,10-methenyl THF in HEL cells and slightly decreased levels of 
5,10-methenyl THF in Ramos cells (middle). c, The use of THF by the 
purine synthesis pathway (left cycle), the TMP synthesis pathway (right 
external cycle), the methyl cycle (right internal cycle), and by FTCD. 
5,10-Methenyl THF is metabolized to 5-formyl THF by the enzyme serine 
hydroxymethyltransferase** **. d, The greater cellular pool of THF and 
significantly higher abundance of nucleotides in methotrexate-treated 
cells after FTCD depletion. THF (top), IMP (middle) and TTP (bottom) 


We focused our efforts on haematopoietic cell lines because methotrexate 
is most frequently used to treat haematopoietic malignancies”. The two 
highest-scoring'” genes in the screen were SLC19A1 and FTCD (Fig. 1b, 
Extended Data Fig. 1d, e). SLC19A1 is a reduced folate transporter that 
imports methotrexate into cells, and its depletion reduces methotrexate 
sensitivity in cultured cells'! and in patient tumours'??. 

FTCD is an enzyme involved in histidine catabolism and has not pre- 
viously been associated with methotrexate sensitivity. Depletion of 
FTCD using two distinct single-guide RNAs (sgRNAs) increased the 
ECo9 (90% of maximal effective concentration) of methotrexate by 
more than tenfold relative to controls (Fig. 1c—e). Importantly, expres- 
sion of the murine Ftcd cDNA, which is resistant to sgRNA-mediated 
targeting, re-sensitized FTCD-depleted cells to methotrexate, validat- 
ing the results of our screen (Fig. 1c-f). RNA interference (RNAi)- 
mediated knockdown of FTCD also increased the ECgy of methotrexate 
(Extended Data Fig. 1f, g), which suggests that even subtle reductions 
in FTCD expression are sufficient to alter methotrexate sensitivity. 
CRISPR-Cas9-mediated depletion of FTCD also decreased the sensi- 
tivity of Ramos (Burkitt’s lymphoma) and LAMA84 (chronic myeloid 
leukaemia) cells to methotrexate (Fig. 1f, Extended Data Fig. 1h), which 
demonstrates that our findings are generalizable to cell lines derived 
from additional haematopoietic malignancies. 

FTCD catalyses two reactions in the histidine degradation path- 
way'“ (Fig. 2a, Extended Data Fig. 2a). The formimidoyltransferase 
domain metabolizes tetrahydrofolate (THF) and the histidine- 
breakdown product formiminoglutamic acid (FIGLU) to produce glu- 
tamate and 5-formimino THE. The cyclodeaminase domain further 


levels were measured by LC-MS in vehicle- and methotrexate-treated 
HEL and Ramos cells. e, Newly synthesized IMP (top) and TTP (bottom) 
in FTCD-depleted, methotrexate-treated cells shown by [U-'°C]serine 
labelling. IMP and TTP fractional labelling is shown for vehicle-treated 
and methotrexate-treated HEL and Ramos cells. Right, the four carbons 
that are contributed to IMP or the one carbon contributed to TTP by 
[U-3C]serine. For b, d, e, 1 = 3, biological replicates. Statistical tests 
used were a one-way ANOVA (b, d) and a two-way ANOVA for the 
unlabelled fractions (e). ANOVA was done as multiple comparisons 
between sgFTCD and each of the other samples. Error bars indicate s.e.m. 
Abbreviations: AICART, aminoimidazolecarboxamide ribonucleotide 
formyltransferase (AICAR formyltransferase); DHFR, dihydrofolate 
reductase; GART, phosphoribosylglycinamide formyltransferase (GAR 
transformylase); IMP, inosine monophosphate; MS, methionine synthase; 
MTHER, methylene tetrahydrofolate reductase; NS, not significant; 
SHMT, serine hydroxymethyltransferase; TTP, thymidine triphosphate; 
TYMS, thymidylate synthase. 


metabolizes 5-formimino THF to 5,10-methenyl THE. We profiled 
FTCD-relevant metabolites in FTCD-depleted HEL and Ramos cells 
using liquid chromatography coupled with mass spectrometry (LC- 
MS). Depletion of FTCD increased the levels of histidine (Fig. 2b, top, 
Extended Data Fig. 2b), and decreased the levels of 5,10-methenyl THF 
(Fig. 2b, middle, Extended Data Fig. 2b) and the downstream metabo- 
lite 5-formyl THF (Fig. 2b, bottom, Extended Data Fig. 2b). The modest 
decline in 5,10-methenyl THF is probably due to an additional pool 
of this metabolite, synthesized by the enzyme methylenetetrahydro- 
folate dehydrogenase 1 (MTHFD1). This pool of 5,10-methenyl THF 
is directly channelled to 10-formyl THF by MTHFD1 and is unlikely 
to serve as a substrate for the build-up of 5-formyl THF’. These data 
indicate that depletion of FTCD decreases flux through the histidine 
degradation pathway. 

Methotrexate treatment causes a decrease in the cellular pool of THE, 
which inhibits nucleotide synthesis and is considered to be the main 
cause of cell death by the treatment*. The use of THF by FTCD results 
in the synthesis of 5-formyl THE, a folate entity that has no described 
role as an enzyme cofactor and is considered to be a storage form of 
folate’® (Fig. 2c, Extended Data Fig. 2d). We proposed that the con- 
sumption of THF by FTCD for the synthesis of 5-formyl THF would be 
particularly harmful to cells in which the THF pool is already limiting, 
such as in methotrexate-treated cells. Indeed, depletion of the FTCD 
gene resulted in significantly higher levels of THF in methotrexate- 
treated cells (Fig. 2d, top, Extended Data Figs. 2c, 3c, d), indicating that 
the use of THF by FTCD contributes to the exhaustion of THF pools 
observed in wild-type cells treated with methotrexate. 
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Fig. 3 | The histidine degradation pathway affects the sensitivity of 
cancer cells to methotrexate and HAL expression is associated with 
treatment response in patients with ALL. a, CRISPR-Cas9-induced 
HAL depletion decreased the sensitivity of HEL and Ramos cells to 
methotrexate. The average fold change in the ECy9 of methotrexate 
relative to the wild-type is shown (n= 2, biological replicates, one-way 
ANOVA, as multiple comparisons between sgHAL and each of the other 
samples). b, The greater cellular pool of THF in methotrexate-treated 
cells after HAL depletion. Top, THF levels were measured by LC-MS 

in vehicle- and methotrexate-treated HEL and Ramos cells. Newly 
synthesized AMP (middle) and TTP (bottom) in HAL-depleted and 
control cells, treated with methotrexate, shown by [U-!3C]serine labelling 
(n = 3, biological replicates; for THE, one-way ANOVA; for AMP and 
TTP, two-way ANOVA for the unlabelled fraction). c, HAL expression 

is significantly higher in cells that are more sensitive to methotrexate. 
The response to methotrexate was determined in a pooled fashion using 
genomically barcoded cell lines (Fig. 1a). HAL expression was measured 


We observed significantly higher levels of inosine monophosphate 
(IMP) and thymidine triphosphate (TTP) after methotrexate treatment 
in FTCD-depleted cells compared to control cells (Fig. 2d middle and 
bottom), which suggests that the increased THF pool size in these cells 
contributes to nucleotide biosynthesis. To examine the ability of metho- 
trexate-treated, FTCD-depleted cells to synthesize nucleotides, we used 
uniformly labelled (U-) ['?C]serine to label newly synthesized nucleo- 
tides in cells treated with methotrexate. Loss of FTCD significantly 
increased the labelling of IMP and TTP in methotrexate-treated cells 
(Fig. 2e), without changing the overall labelling efficiency (Extended 
Data Fig. 2e, f) or the cellular pools of glycine and serine (Extended 
Data Fig. 3a, b). These results suggest that FTCD depletion enables the 
survival of methotrexate-treated cells by sparing cellular THF pools, 
and thus enabling de novo nucleotide synthesis to continue. 

To determine the contribution of other components of the histidine 
degradation pathway (Fig. 2a) to methotrexate sensitivity, we genetically 
targeted histidine ammonia lyase (HAL) and amidohydrolase domain 
containing 1 (AMDHD1), two enzymes of the histidine degradation 
pathway that function upstream of FTCD!”~!’. Similar to our find- 
ings with FTCD, CRISPR-Cas9-mediated depletion of these enzymes 
decreased the sensitivity of cells to methotrexate (Fig. 3a, Extended 
Data Fig. 4a). As in FTCD-depleted cells, methotrexate-treated, HAL- 
depleted cells had significantly higher levels of THF (Fig. 3b, top), and 
higher rates of nucleotide synthesis as monitored by [U-'%C]serine 
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by qPCR (n= 4 for resistant cell lines and n = 6 for sensitive cell lines, 
biologically independent samples, two-tailed Kolmogorov—Smirnov test). 
d, e, Differences in HAL expression are associated with methotrexate 
sensitivity in cancer cell lines. d, Cell lines were ranked by HAL expression 
according to RNA-seq data”!. Green, cell lines with high HAL expression. 
Blue, cell lines with low HAL expression. e, HAL expression (by qPCR) 
and methotrexate ECgg are shown for each of the coloured cell lines from 
d (low HAL (n=3), high HAL (n=7), two-tailed Kolmogorov-Smirnov 
test, individual ECgo values: n = 2, biological replicates). f, CRISPR-Cas9- 
depletion of HAL in EOL-1 and NCIH1666 cells decreased their sensitivity 
to methotrexate. Fold changes in methotrexate ECoy compared to wild 
type are shown (n = 2, biological replicates, one-way ANOVA). g, HAL 
expression predicts better survival in paediatric patients with ALL treated 
with a regimen that included methotrexate. Kaplan-Meier curves of 
overall survival of patients with ALL with high (top quantile, green) or low 
(bottom quantile, blue) expression of HAL”4 (n = 21). Error bars indicate 
s.e.m. 


tracing in comparison to control cells (Fig. 3b, middle and bottom). 
Notably, whereas control cell lines show increased total serine and 
[U-'3C]serine levels upon methotrexate treatment, HAL-depleted 
HEL cells show total and [U-!’C]serine levels that are similar to vehi- 
cle-treated cells (Extended Data Fig. 4d, f). The increased serine levels 
in methotrexate-treated control cells are probably the result of reduced 
serine usage owing to limiting cellular levels of THE. Glycine label- 
ling appears to be the same in all samples (Extended Data Fig. 4c, e). 
These data suggest that, as with the depletion of FTCD, the depletion 
of HAL results in a higher availability of reduced folate and thus greater 
nucleotide synthesis rates in cells treated with methotrexate. 

We found that the expression of HAL—which encodes the rate- 
limiting enzyme in the histidine degradation pathway*’—but not of 
FTCD or AMDHDI was significantly higher (Fig. 3c, Extended Data 
Fig. 4b) in haematopoietic cell lines that we identified as being highly 
sensitive to methotrexate (Fig. 1a, Extended Data Fig. la-c). To follow 
up on this finding, we analysed RNA-seq data from 1,010 cell lines”! to 
identify those with the highest and lowest expression of HAL (Fig. 3d), 
and found that cell lines with high HAL expression were more sensitive 
to methotrexate than those with low expression (Fig. 3e). Depletion of 
HAL in two high HAL-expressing lines (EOL-1 and NCIH1666) sub- 
stantially decreased their methotrexate sensitivity (Fig. 3f, Extended 
Data Fig. 5a), revealing a functional role for HAL in the response 
of these cells to the drug. Thus, cells with endogenously low or 
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Fig. 4 | In vivo histidine supplementation increases flux through the 
histidine degradation pathway and sensitizes tumours to methotrexate. 
a, Workflow for the in vivo assessment of the methotrexate-sensitivity of 
tumour xenografts after histidine supplementation. HEL or SEM cells were 
injected subcutaneously into NOD-SCID mice. Three weeks later tumour 
size was assessed by in vivo imaging and mice were randomly divided into 
four experimental groups: vehicle-treated, histidine supplementation, 
methotrexate-treated and histidine supplementation combined with 
methotrexate treatment. The treatments were followed by a second 
imaging session. b, c, The combination of methotrexate and histidine 
supplementation is the only treatment that resulted in a significant 
reduction in tumour size. Fold changes in tumour size as measured 

in vivo before and after the different treatments are presented for HEL- 
cell-derived tumours (b), and SEM-cell-derived tumours (c) (non- 
parametric one-way ANOVA, Kruskal-Wallis test, Dunn's post hoc 

test). d, The combination of methotrexate treatment and histidine 
supplementation caused cancer cell death. Haematoxylin and eosin 
staining of tumours from mice injected with HEL cells and treated with 
the different regimes is presented at two magnifications. Mitotic cells 


exogenously lowered HAL expression are both less sensitive to meth- 
otrexate than control cells. 

Methotrexate is used as the standard of care in the treatment of the 
most common paediatric malignancy, acute lymphoblastic leukaemia 
(ALL)”, with relatively high success rates that unfortunately are accom- 
panied by severe toxicity’. At present, treatment-outcome prediction 
markers are lacking”’, leading physicians to prescribe methotrexate 
regimens based on clinical parameters instead of personalized pharma- 
cogenetic factors. Notably, we found that young patients with ALL that 
showed high HAL expression in the ALL cells have significantly higher 
survival rates, compared to patients in the same study” with low HAL 
expression (Fig. 3g). Expression levels of SLC19A 1, which encodes the 
transporter for methotrexate, as well as the genes (AMDHD1 and 
FTCD) encoding histidine degradation pathway enzymes showed no 
significant association with patient survival (Extended Data Fig. 5b-d). 
These data warrant further investigation of the expression level of HAL 
as a prediction factor for treatment success rate in patients with ALL 
that are treated with methotrexate. 

The association between expression levels of the rate-limiting enzyme 
in the histidine degradation pathway and methotrexate sensitivity in 
cancer cell lines and patients raised the possibility that increasing flux 
through this pathway could enhance the cytotoxic effects of methotrex- 
ate. We proposed that in vivo histidine supplementation would increase 
histidine degradation and consumption of THE by FTCD, which might 
enhance the toxic effects of methotrexate on tumour cells. To assess the 
response of cancer cells to methotrexate in the presence of above-normal 


(blue arrows) and apoptotic or necrotic cells (orange arrows) are marked 
in the bottom panel. Scale bars: 200 {1m (top), 10 1m (bottom). e, f, The 
combination of methotrexate and histidine supplementation was the only 
treatment that resulted in significant necrosis in tumours. Necrotic areas 
as detected by haematoxylin and eosin staining (d, top three rows, right 
column) were measured in the different groups (non-parametric one- 

way ANOVA, Kruskal-Wallis test, Dunn’s post hoc test). g, THF levels 
decreased after methotrexate treatment and decreased even further when 
methotrexate was combined with histidine supplementation. Methotrexate 
levels were not different in tumours from mice treated with methotrexate 
alone or in combination with histidine supplementation. Nucleotide 
abundance was significantly lower in tumours from mice treated with 
methotrexate and histidine supplementation compared to tumours from 
vehicle-treated mice (non-parametric one-way ANOVA, Kruskal-Wallis 
test, Dunn’s post hoc test. Methotrexate, two tailed t-test). b, d, e, vehicle 
(n=5), histidine (n = 4), methotrexate (n = 6), methotrexate + histidine 
(n=6).¢, f, (n=6), (n=7), (n=7), (n=7) for the same groups. g, (n=5), 
(n=4), (n=5), (n=6), for the same groups. Centre bars indicate mean, 
error bars indicate s.e.m. 


levels of histidine, we generated tumours by injecting HEL or SEM cells 
subcutaneously into NOD-SCID mice and treated the tumour-bearing 
mice with vehicle, histidine, methotrexate or a combination of histidine 
and methotrexate (Fig. 4a). To reliably assess the contribution of histi- 
dine supplementation, we intentionally used a mild regimen of meth- 
otrexate consisting of doses lower than those normally used in mice”®. 

As expected, tumours from mice treated with this reduced dosing 
of methotrexate did not have a significant decrease in size (Fig. 4b, c, 
Extended Data Fig. 6a, b). However, tumours from mice given both 
methotrexate and histidine showed a marked decrease in tumour size 
that was significantly greater than in any of the other treatment cohorts 
(Fig. 4b, c). Additionally, tumours from mice treated with methotrex- 
ate and histidine supplementation appeared histologically worse and 
contained large necrotic areas and what morphologically appeared to 
be necrotic or apoptotic cells (Fig. 4d—-f, Extended Data Figs. 6c, d, 7a). 
Organs known to be adversely affected by methotrexate showed no 
further toxicity in mice treated with the combined therapy compared 
to mice treated with methotrexate alone (Extended Data Figs. 7b, 9a), 
and no difference was observed in the weight loss of the mice treated 
with methotrexate alone compared to methotrexate treatment supple- 
mented with histidine (Extended Data Fig. 9b). No enhanced toxicity 
was observed in mice treated with the combined therapy compared 
to methotrexate alone even when the mice were treated for 15 days 
(Extended Data Figs. 10, 11). 

According to our hypothesis, histidine supplementation should 
increase the flux through the histidine degradation pathway and cause 
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increased consumption of THF, a limiting metabolite in methotrex- 
ate-treated tumours. Notably, THF levels were significantly lower in 
tumours from mice treated with the combined therapy compared to 
methotrexate alone, which suggests lower THF availability in these 
tumours, although methotrexate levels were not different (Fig. 4g, 
Extended Data Fig. 8). Both methotrexate and folate levels were very 
similar in the plasma of mice treated with methotrexate alone or in 
combination with histidine supplementation (Extended Data Fig. 9e, f). 
Metabolites of the histidine degradation pathway were increased in 
HEL-cell-derived tumours in mice treated with histidine supplemen- 
tation (Extended Data Fig. 9c, d). 

Lower availability of THF should compromise the capacity of cells 
to synthesize nucleotides, which are essential for cell proliferation and 
viability. Indeed, the levels of AMP, GMP and TMP in tumours from 
mice treated with both methotrexate and histidine were significantly 
lower than those found in tumours from vehicle-treated mice (Fig. 4g, 
Extended Data Fig. 8). Collectively, our data indicate that histidine 
supplementation can increase the efficacy of methotrexate in mice in 
vivo by enhancing the depletion of THF within tumour cells. 

Taken together, our results provide evidence for a role of the histidine 
degradation pathway in the cellular response to the widely used chemo- 
therapy agent methotrexate. Endogenous expression levels of HAL, 
which encodes the rate-limiting enzyme of the pathway, are associated 
with the sensitivity of cancer cell lines to methotrexate, and the overall 
survival of patients with ALL who are treated with methotrexate. These 
findings suggest that HAL expression might serve as a clinical predictor 
for better responders to methotrexate treatment among patients with 
ALL, and may be informative for decisions regarding therapy strategies. 
Furthermore, both genetic perturbation and dietary enhancement of 
the pathway changed the sensitivity of haematopoietic cancer cells to the 
chemotherapy. As standard protocols for administering methotrexate 
are often accompanied by severe toxicity”, we suggest that dietary 
supplementation with histidine could represent a relatively low-risk 
intervention that might enable reduced dosing of this toxic agent and 
therefore a greater clinical benefit. 
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METHODS 


Barcoding and co-culture of 42 haematopoietic cell lines. Cell lines used in this 
manuscript were authenticated by short tandem repeat analysis and were tested and 
found negative for mycoplasma. The sources of cell lines were as follows. SKM1, 
P12-Ichikawa, MOLT4, PF382, SEMK2: T. Look (Dana-Farber Cancer Institute 
(DFCI)); SUDHL4, HBL, LY7, U2932: M. A. Shipp (DFCI); EOL-1, CEM, REH: 
S. A. Armstrong (DFCI); KU812, NOMO1, K562, HEL: J. Griffin (DFCI); 697, 
NALM6E: D. M. Weinstock (DFCI); SUDHL8, LY8: A. G. Letai (DFCI); NCIH1666, 
D. Lauffenburger (MIT); Mono Mac 6: M. Zaitseva (FDA); WSU-DLCL2: A. M. 
Al-Katib (Wayne State University School of Medicine); CH-157 MN: R. Jensen 
(University of Utah). All other cell lines were in the Sabatini laboratory’s cell bank 
and were collected or purchased by former lab members. 
Construction of unique barcoded plasmids. The oligonucleotides BC-top and 
BC-bottom were annealed and phosphorylated (T4 PNK, NEB) to generate a 
linear DNA fragment containing a 5’-Agel overhang, a 3’-EcoRI overhang, and a 
degenerate NNNNNNN 7-mer (N is A, T, G, C). The annealed product was cloned 
into pLKO to generate a collection of pLKO-BC plasmids containing unique 7-mer 
barcodes that were subsequently determined by Sanger sequencing, as similarly 
described elsewhere”. 

BC-top: 
CCGGTTTTTAGCATCGCCNNNNNNNCTCGCGGCCGCAGGTCCATG 

BC-bottom: 
AATTCATGGACCTGCGGCCGCGAGNNNNNNNGGCGATGCTAAAAA 
Virus production. HEK-293T cells were co-transfected with a pLKO-BC plasmid, 
the VSV-G envelope plasmid and the AVPR lentiviral packaging plasmid using 
XTremeGene 9 Transfection Reagent (Roche). Culture medium was exchanged 
16h after transfection with the same medium instead supplemented with 20% 
inactivated fetal bovine serum (IFS). The virus-containing supernatant was col- 
lected 48h after transfection and passed through a 0.45-|1m filter to eliminate cells. 
Transduction of cell lines. Cells were pelleted, then seeded at a density of 300,000 
cells per ml in 6-well plates in 2ml of RPMI containing 8 1g ml“! polybrene (EMD 
Millipore), and then transduced with lentivirus by centrifugation at 1,200g for 
90 min at 37°C. After a 24-h incubation, cells were pelleted to remove virus and 
then re-seeded into fresh culture medium containing puromycin, and selected 
for 72h. 
Competitive survival of barcoded cell lines. Forty-two haematopoietic cancer cell 
lines that each carried a stably incorporated specific genomic barcode were pooled 
together and treated for 6 days with methotrexate (0, 0.1, 0.5 and 5\1M). Genomic 
DNA (gDNA) samples were collected at days 2, 4 and 6 and barcode abundances 
were determined by high-throughput sequencing (Illumina MiSeq). 
Positive-selection CRISPR-Cas9-based screen. Before the screen, the ECoo of 
methotrexate in HEL cells was determined in culture conditions similar to the 
screen. Detailed protocols for the CRISPR-Cas9-based screen are described else- 
where*”**, In short, the sgRNA library’ was transformed and propagated for virus 
production as described above, followed by HEL transduction at an efficiency of 
60% to avoid more than one sgRNA in each cell. After selection of transduced cells 
by puromycin, cells were divided into two replicates each of DMSO- and metho- 
trexate-treated groups and were cultured for 14 days. Cells (100 x 10°) were col- 
lected for genomic DNA purification and sgRNA library deconvolution by HiSeq 
deep sequencing. Sequencing reads were aligned to the sgRNA library and the 
abundance of each sgRNA was calculated’. The counts from all replicates of the 
initial cell populations were combined to generate the initial reference dataset. The 
counts from each sample were normalized for sequencing depth. sgRNAs with 
fewer than 40 reads in the initial reference dataset were omitted from downstream 
analyses. The log fold change in abundance of each sgRNA between the final and 
initial reference populations was calculated and used to define a CRISPR score for 
each gene’. The CRISPR score is the average log, fold change in abundance of all 
sgRNAs targeting a given gene. Genes targeted by fewer than four distinct sgRNAs 
in the initial reference dataset were omitted from downstream analyses. 
Gene targeting using single sgRNAs. To generate HEL, Ramos, LAMA84, EOL-1 
and NCIH1666 cells stably expressing the indicated sgRNAs, the following oligo- 
nucleotides were cloned into the pLentiCRISPR V1 viral vector. 
Oligonucleotides used for sgRNA cloning. sgA AVS: caccgTCCCCTCCACCCCA- 
CAGTG (sense), aaacCACTGTGGGGTGGAGGGGAc (antisense). sgFTCD_1: 
caccgGCA GGTTGAGCGCGATGCGG (sense), aaacCCGCATCGCGCT- 
CAACCTGCec (antisense). sgFTCD_2: caccgGGAGCCATCACACAGACCCC 
(sense), aaacGGG GITCTGTGTGATGGCTCCe (antisense). sgSLC19A 1: caccegTG- 
CGCGGGTCTA CAACGGCG (sense), aaacCGCCGTTGTAGACCCGCGCAc 
(antisense). sg HAL: caccgGCAGGCAGCTCACTGAGGGA (sense), aaac TCCCT- 
CAGTGA GCTGCCTGCec (antisense). ssAMDHD1: accgATGGAAATTCAC- 
CAGGCCGG (sense), aaacCCGGCCTGGTGAATTTCCATc (antisense). 

To generate lentivirus, 3 x 10° HEK-293T cells were seeded in a 10-cm plate 
in DMEM supplemented with 10% IFS. After 24h, the cells were transfected with 
the above sgRNA pLenti-encoding plasmids alongside the AVPR envelope and 
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CMV VSV-G packaging plasmids using the XTremeGene 9 Transfection Reagent 
(Roche). Twelve hours after transfection, the medium was aspirated and replaced 
with 8 ml of fresh medium. Virus-containing supernatants were collected 48h after 
the transfection and passed through a 0.45-1m filter to eliminate cells. The trans- 
duction of cells was performed as described above. 

Gene knockdowns using shRNAs. Lentiviral shRNAs were obtained from The 
RNAi Consortium’. Lentivirus production and viral transduction of HEL cells 
were performed as described above, followed by a three-day selection with puro- 
mycin. Knockdown was verified 48 h after completion of selection by qPCR. 
shRNA sequences. GFP, T@CCCGACAACCACTACCTGA. RFP, CGCGTGAT 
GAACTTCGAGGAC. LacZ, CCAACGTGACCTATCCCATTA. FTCD 
shRNA 1, CCGGGTAGCTTCCCGACTTAT. FTCD shRNA 2, CGAGAAG 
GAGAACCTCTTCAT. SLC19A1 shRNA 1, CGACGGTGTTCAGAATGTGAA. 
SLC19A1 shRNA 2, GATTGCATCTTCTCTGTCTAA. 

Methotrexate treatment of cell lines in culture. Unless otherwise indicated, 51M 
methotrexate (Sigma) was added to RPMI supplemented with 10% IFS for at least 
3 days. Owing to their relatively low sensitivity to the drug, Ramos cells were 
treated with 20)1M methotrexate. 

Cell survival assays. Cells were seeded at a cell density of 50,000 cells per ml in 
96-well plates. Three hours later, cells were treated with 7-10 concentrations of 
methotrexate or doxorubicin in triplicate. Cells were incubated for 5 days, after 
which an ATP-based, cell viability assay was performed (CellTiter Glo, Promega). 
Survival curves were calculated by best-fit analysis of the log of the drug concen- 
tration (in |1M) to fold change of treated cells over vehicle-treated cells. All survival 
assays included technical triplicates per sample, per experiment. 

Cell counting. Cells were counted using a Coulter Counter (Beckman Z2). The 
percentage viability was calculated by a FACS-based (BD LSR flow cytometer) 
viability assay using 7-AAD staining (Thermo Fisher Scientific). Each experiment 
included technical triplicates per sample. 

Metabolite profiling by mass spectrometry. Sample preparation for polar metabolite 
detection. For cells in culture, two million cells were collected, washed once with 
0.9% NaCl and resuspended in extraction buffer (80% methanol, 20% H2O plus 
isotopically labelled internal standards). Samples were then mixed by vortexing for 
10s and the debris was pelleted by a 10-min spin at 18,000. The supernatant was 
then transferred to a new tube and dried using a liquid nitrogen dryer. 

For tumours, the tumours were collected and immediately snap-frozen in liquid 
nitrogen. Tumours were thawed by homogenization (Kimble cordless handheld 
homogenizer) in extraction buffer, followed by the procedure described for cell 
culture samples. All metabolites measured in tumours were normalized to an average 
of four amino acids (phenylalanine, leucine, valine and tyrosine) as an internal 
loading control. 

For plasma samples, blood was collected from the submandibular vein into 
anticoagulant tubes (Microvette, Sarstedt), and immediately placed on ice. Samples 
were spun for 6 min at 1,000g and supernatants were collected. From each sample, 
1011 was collected and used for further LC-MS analysis by extraction in 90 1l 
extraction buffer and sample preparation as described for cultured cells. 

Sample preparation for folate detection. The protocol for folate detection was similar 
to that previously described*4, and adapted with a minor change: the extraction 
buffer used was 80% methanol, 20% 12.5mM sodium ascorbate. All subsequent 
steps were as described*“. In brief, samples were washed once (0.9% NaCl), then 
extracted in extraction buffer (80% methanol, 20% 12.5mM sodium ascorbate, 
plus aminopterin as internal standard), mixed by vortexing for 10s and debris was 
pelleted by a 10-min spin at 18,000 g. Supernatants were split for polar metabolite 
and folate detection and dried using a liquid nitrogen dryer manifold. Polar sam- 
ples were resuspended in water to be run in the mass spectrometer. Folate samples 
were resuspended in reconstitution buffer (0.5% ascorbic acid, 1% K,HPO, and 
0.5% 2-mercaptoethanol) containing rat serum (Sigma, R9759). The rat serum 
contains the enzymes required to strip the polyglutamate tail from the measured 
folate. Rat endogenous folate was removed from the rat serum by activated carbon 
treatment (Sigma, C9157). Polyglutamate tail removal was carried out for 2h at 
37°C. After the polyglutamate tail removal, the pH was adjusted to 4 using formic 
acid and samples were loaded on Bond Elute-pH columns (Agilent, 14102062) for 
a pH-based clean-up. Columns were eluted with 3001 elution buffer (50% meth- 
anol, 0.25% NH,OAc, 0.5% 2-mercaptoethanol), followed by drying the samples 
using a liquid nitrogen dryer manifold and resuspension in 501 water. 

Polar metabolite profiling by LC-MS. Dried polar samples were resuspended in 
50 ul water and 211 was injected into a ZIC-pHILIC 150 x 2.1 mm (51m particle 
size) column (EMD Millipore). Analysis was conducted on a QExactive benchtop 
Orbitrap mass spectrometer equipped with an Ion Max source and a HESI II probe, 
which was coupled to a Dionex UltiMate 3,000 UPLC system (Thermo Fisher 
Scientific). External mass calibration was performed using the standard calibra- 
tion mixture every 7 days. Chromatographic separation was achieved using the 
following conditions: buffer A was 20mM ammonium carbonate, 0.1% ammonium 
hydroxide; buffer B was acetonitrile. The column oven and autosampler tray were 
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held at 25°C and 4°C, respectively. The chromatographic gradient was run at a 
flow rate of 0.150 ml min“! as follows: 0-20 min: linear gradient from 80% to 20% 
B; 20-20.5 min: linear gradient from 20% to 80% B; 20.5-28 min: hold at 80% B. 
The mass spectrometer was operated in full-scan, polarity switching mode with 
the spray voltage set to 3.0 kV, the heated capillary held at 275°C, and the HESI 
probe held at 350°C. The sheath gas flow was set to 40 units, the auxiliary gas flow 
was set to 15 units, and the sweep gas flow was set to 1 unit. The data acquisition 
was performed over a range of 70-1,000 m/z, with the resolution set at 70,000, 
the automatic gain control target at 10e6, and the maximum injection time at 
20 ms. Relative quantitation of polar metabolites was performed with XCalibur 
QuanBrowser 2.2 (Thermo Fisher Scientific) using a 5 p.p.m. mass tolerance and 
referencing an in-house library of chemical standards. 

Histidine was normalized to isotopically labelled histidine as an internal stand- 

ard; IMP, AMP and TTP levels were normalized to isotopically labelled glutamate 
as an internal standard. 
Detection of folate species by LC-MS. Detection of folate species was performed 
on the same instrumentation described above. In general, instrument settings 
remained the same unless specified. After removal of the polyglutamate tails from 
folate species**, samples were resuspended in 50 ul water and 15 1] was injected 
onto an Ascentis Express C18 HPLC column (2.7 ym x 15cm x 2.1mm; Sigma 
Aldrich). The column oven and autosampler tray were held at 30°C and 4°C, 
respectively. The following conditions were used to achieve chromatographic sep- 
aration: buffer A was 0.1% formic acid; buffer B was acetonitrile with 0.1% formic 
acid. The chromatographic gradient was run at a flow rate of 0.250 ml min“! as 
follows: 0-5 min: gradient was held at 5% B; 5-10 min: linear gradient of 5% to 
36% B; 10.1-14.0 min: linear gradient from 36-95% B; 14.1-18.0 min: gradient 
was returned to 5% B. The mass spectrometer was operated in full-scan, positive 
ionization mode. Data acquisition was performed using three narrow-range scans: 
438-450 m/z; 452-462 m/z; and 470-478 m/z, with the resolution set at 70,000, the 
automatic gain control target at 10e6, and the maximum injection time of 150 ms. 
Relative quantitation of folate species was performed with XCalibur QuanBrowser 
2.2 (Thermo Fisher Scientific) using a 5 p.p.m. mass tolerance. Non-glutamated 
folate species were identified using chemical standards (see below), together 
with comparison of experimental MS2 spectra with in silico predicted spectra 
(Supplementary Fig. 1) Mirror plots were created using R and structures were 
identified based on the known parental structure (HMDB, www.hmdb.ca), using 
a combination of ChemDraw (PerkinElmer), CEM:ID*? (cfmid.wishartlab.com), 
METLIN (metlin.scripps.edu) and Sirius***” (version 3.5.1). 

The standards for different forms of folate are as follows: aminopterin (Sigma, 
A3411); folate (Schircks Laboratories, 16.203); THF (Schircks Laboratories, 
16.207), 5-formyl THF (Schircks Laboratories, 16.221); triglutamate 5-formyl 
THE (Schircks Laboratories, 16.283); 5,10-methenyl THF (Schircks Laboratories, 
16.230); 5-methyl THF (Schircks Laboratories, 16.235). Levels of all folate forms, 
as well as methotrexate were normalized to aminopterin as an internal standard. 
Mass spectrometry reagents. Chloroform was purchased from Sigma-Aldrich and 
was HPLC grade (ethanol-stabilized). Acetonitrile was purchased from Merck 
Millipore and was LC-MS Hypergrade. All other solvents were purchased from 
Fisher and were Optima LC-MS grade. Folate standards were purchased from 
Schircks laboratories. A mix of 17 isotopically labelled amino acids (Cambridge 
Isotope Laboratories) was used as the internal standard. FIGLU standard was pur- 
chased from Dalton Research Molecules (DC-001551). 

Labelling with [U-!C]serine. Cells (500,000) were seeded in RPMI containing 
10% dialysed IFS and cultured for 24h. For FTCD-null cells (Fig. 2e), HEL cells 
were treated with 51M methotrexate for 48h, and Ramos cells were treated with 
201M methotrexate for 72h to decrease THF levels and nucleotide synthesis in 
wild-type cells. The medium was then replaced with [U-'°C]serine-containing 
medium by counting 1 x 10° cells that were washed once with PBS, and reseeded in 
RPMI containing 10% dialysed IFS that contained [U-'°C]serine and no unlabelled 
serine. Cells were incubated in [U-!C]serine plus methotrexate for an additional 
24h followed by cell collection and LC-MS analysis. For HAL-null cells (Fig. 3b), 
Ramos cells were treated with 20|1M methotrexate for 48h before [U-°C]serine 
labelling to decrease THF levels and nucleotide synthesis in wild-type cells. The 
medium was then replaced with [U-'°C]serine-containing medium by counting 
1 x 10° cells that were washed once with PBS, and reseeded in RPMI containing 
10% dialysed IFS that contained [U-4C]serine and no unlabelled serine. Cells were 
incubated in [U-'?C]serine plus methotrexate for an additional 24h followed by cell 
collection and LC-MS analysis. HEL cells were treated with 51M methotrexate for 
24h, in parallel with the [U-'3C]serine labelling. Vehicle-treated cells were treated 
with equivalent concentrations of DMSO for the same times. Natural abundances 
of labelled nucleotides were corrected as previously described**”?. 

Quantitative PCR. Cell pellets were washed once with PBS before RNA purifi- 
cation using Trizol (ambion). The reverse transcriptase reaction was performed 
using 21g RNA from each sample (SuperScript II], Thermo Fisher Scientific). 
Quantitative PCR (qPCR) reactions were performed with SYBR green reagent 


(Roche) and according to the manufacturer's instructions. The QuantStudio 6 Flex 
(Applied Biosystems) qPCR instrument was used for all reactions. The reference 
genes used in each qPCR run were UBC and HPRT1. 

Primers used for qPCR. FTCD: forward primer: T@CGAGAAGGAGAACCTCTT, 
reverse primer: CTCGATGATCCGCTCCTTAG. HAL: forward primer: GGCTG 
AAGGTGGTCTGAACT, reverse primer: AGTCAACAGACGAGGGATGG. 
AMDHDI: forward primer: AGTCTCACACACACGGATGG, reverse primer: 
GCCT CCGAACTGGTAAATCA. SLC19A1: forward primer: GAGAGCTTCA 
TCACCCCCTA, reverse primer: GGCCAGGTAGGAGTACGACA. Mouse 
Ftcd: forward primer: AACCTGCTAAGCACCAAGGA, reverse primer: 
GGACC TTTTTCAGACGTCCA. mouse Hal: forward primer: GGTGGCCTT 
AGAGGACAATG, reverse primer: GCTCCCGGTATTTGCTGTAG. Mouse 
Amdhd1: forward primer: TCCACGAGTTTGCAATGAAG, reverse primer: 
CTCCACCGTGAAGTTGATCC. GFP: forward primer: ACAACGTCTA 
TATCATGGCCGA, Reverse primer: GAAGTTCACCTTGATGCCGTTC. 
UBC: Forward primer: ATTTGGGTCGCGGTTCTT, Reverse primer: 
TGCCTTGACATTCTCGATGGT. HPRT1: Forward primer: TGAC 
ACTGGCAAAACAATGCA, Reverse primer: GGTCCTTTTCACCAGCAAGCT. 
Mouse tumour xenografts. HEL and SEM cells were transduced with R-luciferase 
expressing vector (FUW-RLuc-T2A-PuroR), and selected with puromycin. Cells 
(2 x 10°) were injected subcutaneously into 50 NOD-SCID mice (HEL) or 30 
NOD-SCID mice (SEM). Three weeks after injection, tumour size was imaged 
using the IVIS Spectrum in vivo imaging system (PerkinElmer) after intraperi- 
toneal injection of 501g per mouse of the water-soluble substrate Coelenterazine 
(Nanolight Technologies). A total of 24 mice (HEL) or 27 mice (SEM) with similar 
tumour sizes were divided into four experimental groups: vehicle-treated (saline), 
histidine supplementation (400 ul of histidine solution of 46 mg ml’, histidine 
maximum solubility at 25°C), methotrexate-treated (50 mg kg~' methotrexate, 
based on the weight measured at the day of first imaging), and histidine supple- 
mentation combined with methotrexate treatment. Mice were then injected daily 
for 5 days. All solutions were warmed to 37 °C before injection to reduce stress. 
After five days of treatment, tumour size was monitored, and tissues were collected 
for LC-MS and histology. The Committee for Animal Care at the Massachusetts 
Institute of Technology (MIT) approved all animal procedures carried out in this 
study. Mice were euthanized when tumours reached the size limit of 1 cm or if other 
symptoms indicative of low health status appeared. These included lethargy, drop 
in weight (10% over a day or 20% since the beginning of the experiment), reduced 
appetite, lack of movement, hunched back and hair loss. 

We have complied with all relevant ethical regulations while conducting animal 
experiments. Sample size was determined as the minimal number of mice abso- 
lutely necessary for reliable, significant results (no less than 6 and no more than 8 
per cohort, depending on the availability of age-matched mice in the colony). Mice 
were distributed randomly between the different groups. Injections and imaging 
were all done by N.K. and were not done blindly. Figure analysis of necrotic areas 
in haematoxylin and eosin (H&E) slides was done blindly. For all H&E results, 
different mice within each group showed similar results. Mice assigned to the 
experiments: NOD-SCID mice - NOD.CB17-Prkdcscid/NCrCrl (originally pur- 
chased from Charles River and maintained at the Whitehead mouse facility by 
self-breeding). HEL-cell-derived tumours experiment: 24 females, ages 8-11.5 
weeks, mean 10.7 weeks. SEM-cell-derived tumours experiment: 26 females, ages 
12-18 weeks, mean 14 weeks. HEL-cell-derived tumours long-term experiment: 
mice 1-22 females, mice 23-31 males, ages 6-10 weeks, mean 10 weeks. 
Plasmid construction. Cloning into pLentiCRISPR plasmids is described above. 
Mouse cDNA constructs (mouse Ftcd, mouse Hal and mouse Amdhd1) were 
cloned into the backbone plasmid pWPI (a contribution from the Didier Trono 
laboratory, Addgene plasmid number 12254) that was cut at the Pmel site, and 
PCR-amplified mouse cDNA was inserted by ligation. The primers used for each 
gene incorporated a 5’ kozak sequence and Pmel sites flanking the cloned gene on 
the 5’ and 3’ sides and are listed below. The luciferase-expression plasmid used for 
in vivo imaging is FUW-RLuc-T2A-PuroR, which is an FUW lentiviral backbone 
with the R-luciferase gene inserted downstream to a constitutive UBC promoter. 
The R-luciferase gene was cloned using EcoRI. The R-luciferase ORF is 936 base- 
pairs long, missing its stop codon and is in frame with the T2A-PuroR. 

Primers used for the cloning. Mouse Ftcd: forward primer: GTACGTTTAAACCT 
CGAGGCCACCATGTCTCAGTTGGTGGAATG, reverse primer: TTCAG 
TTTAAACTCACTCCTTTCGGGCT. Mouse Hal: forward primer: GTACGTT 
TAAAC CTCGAGGCCACCATGCCTAGGTACACAGTGCA, reverse primer: 
TTCAGTTTAAAC TTAAAGATCGTCAGACTCTGGG. Mouse Amdhd1: for- 
ward primer: GIACGTTTAAAC CTCGAGGCCACCATGACGAGCAGCCACC, 
reverse primer: TTCAGTTTAAAC TCACTTCTTGTAGATGACTTTTCC. Pme 
site: GITTAAAC, kozak sequence: CTCGAGGCCACC. 

Plasmids used in this study. pLentiCRISPR sgAAVS (Addgene 83906), pLen- 
tiCRISPR sgFTCD_1 (Addgene 102312), pLentiCRISPR sgFTCD_2 (Addgene 
102313), pLentiCRISPR sgSLC19A1 (Addgene 102314), pLentiCRISPR sgHAL 
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(Addgene 102315), pLentiCRISPR ssAMDHD1 (Addgene 102316), pWP1_ 
mFTCD (Addgene 102317), pWP1_mHAL (Addgene 102318), pWP1_mAM- 
DHD1 (Addgene 102319), FUW-RLuc-T2A-PuroR (Addgene 102320). 
Patient survival curves. Patient phenotypic and RNA-seq data for HAL, FTCD, 
AMDHD1 and SLC19A1 from the Therapeutically Applicable Research to Generate 
Effective Treatments (TARGET) initiative managed by the NCI were downloaded 
from the UCSC Xena Genomics Browser (http://xena.ucsc.edu) and the DCC 
Open sections of the TARGET Data Matrix (https://ocg.cancer.gov/programs/ 
target/data-matrix) on August 15, 2017. Information about TARGET can be found 
at http://ocg.cancer.gov/programs/target. We then focused on a particular patient 
subset: the TARGET Substudy Acute Lymphoblastic Leukaemia (ALL) Expansion 
Phase 2 (phs000218.v18.p7), and further centred our analysis on patients treated 
with the COG AALL0232 high-risk ALL protocol (ClinicalTrials.gov Identifier 
NCT00075725). For patients with multiple represented tumour samples, only the 
first sample was used for downstream analyses. Overall survival was plotted for 
the upper and lower quartile of HAL, FTCD, AMDHD1 and SLC19A1 expression. 
Survival curves were compared by the log-rank test and P values indicated. 

Kaplan-Meier curves were plotted using Prism version 7.0b. End points for 
patients include death and dropout of patients for reasons other than death, in 
which cases survival data are integrated in the curve up to the last day of follow-up. 
The P value was calculated using a log-rank Mantel—-Cox test. 

We have complied with all relevant ethical regulations while conducting this 
patient-derived data collection. 
Reporting summary. Further information on experimental design is available in 
the Nature Research Reporting Summary linked to this paper. 
Data availability. The authors declare that all data generated or analysed during 
this study are included in this published version and its Supplementary Information 
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Files. Source Data are provided with the online version of the paper. All other 
datasets generated and/or analysed in the current study are available from the 
corresponding author on reasonable request. 
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Extended Data Fig. 1 | See next page for caption. 
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Extended Data Fig. 1 | Loss of FTCD decreases the sensitivity of cancer 
cells to methotrexate. a, Relative daily cell counts of the mixed culture 

of 42 genomically barcoded haematopoietic cancer cell lines. We focused 
our efforts on haematopoietic cell lines because methotrexate is most 
commonly used to treat haematopoietic malignancies’. Cells were co- 
cultured and treated with three concentrations of methotrexate 

(0.1, 0.5, or 51M). Cell counts are presented relative to the vehicle-treated 
co-culture. n = 3, biological replicates. Error bars indicate s.d. b, Sequencing 
results of the competitive co-culture experiment. Heat map of relative 
barcode abundance for each indicated cell line after treatment with 5 1M 
methotrexate or vehicle after 2, 4, or 6 days compared to those in the 
initial cultures. c, Validation of the results of the competitive co-culture 
experiment as shown by individual dose-response curves of seven 
relatively sensitive cell lines (MOLM13, EOL-1, SUPM2, DEL, HEL, 
NALM6 and SUDHL1), and six relatively resistant cell lines (HL-60, 
Ramos, HBL-1, Raji, SUDHL8 and Jeko-1). n = 2, biological replicates. 
Each biological replicate included technical triplicate. Error bars indicate 
s.d. d, Top 40 genes that scored in the genome-wide CRISPR-Cas9- 

based positive selection screen. These genes had the largest differential 
CRISPR scores between vehicle- and methotrexate-treated samples. The 
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heat map represents the CRISPR score of two biological replicates of 
each screen (vehicle- and methotrexate-treated). e, Abundance of each 
of the individual sgRNAs targeting FTCD in the screen. Read counts 

for each sgRNA is presented for each of the screen biological replicates 
of the vehicle-treated and methotrexate-treated samples. f, Fold change 
in the ECog values of methotrexate and doxorubicin in HEL cells stably 
expressing shRNA targeting either SLC19A1 or FTCD compared to the 
average of three non-targeting shRNAs (shGFP, shRFP and shLacZ). 

n= 2, biological replicates. Each biological replicate included technical 
triplicate. Error bars indicate s.e.m. g, Expression levels of SLC19A1 (left) 
and FTCD (right) in HEL cells stably expressing non-targeting shRNAs 
(shLacZ and shGFP) or targeting shRNAs (shSLC19A1 and shFTCD). 
Expression levels were measured by qPCR and normalized to the average 
of two control genes, UBC and HPRT1. n=3, technical replicates. Error 
bars indicate s.e.m. h, Validation of genetic depletion of FTCD by the 
CRISPR-Cas9 system in HEL, Ramos and LAMA84 cell lines. Expression 
of FTCD was measured by qPCR and normalized to the average of two 
control genes, UBC and HPRT1. n= 3 (except for HEL cells expressing 
sgFTCD_2+ mFTCD, then n =2 owing to loss of RNA from one sample), 
biological replicates. Error bars indicate s.d. 
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Extended Data Fig. 2 | FTCD depletion enables cancer cells to 

maintain THF pools and nucleotide synthesis even when treated with 
methotrexate (part 1). a, The histidine degradation pathway, as previously 
described'*!7-?. Enzymes are marked in blue. b, Metabolites detected by 
LC-MS and their corresponding retention times. The LC column used for 
the detection of each metabolite is indicated. c, Greater pool of 5-methyl 
THF in vehicle-treated HEL cells after FTCD depletion. It is not readily 
clear why, in vehicle-treated HEL cells, FTCD depletion caused a reduction 
in THF levels (Fig. 2d). However, 5-methyl THF levels in these cells 

were increased significantly (see also Extended Data Fig. 3c), indicating 
that there was no depletion in the overall recyclable amounts of THF in 
these cells. 5-Methyl THF levels were measured by LC-MS in vehicle- 
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treated HEL and Ramos cells. 5-Methyl THF levels were normalized to 
aminopterin as an internal standard. P values were calculated using one- 
way ANOVA. n = 3, biological replicates. Error bars indicate s.e.m. 

d, Chemical structures of the folate entities found in cells and mentioned 
in Fig. 2c. e, f, The labelling rate by [U-!?C]serine is not different between 
FTCD-depleted cells and control cells. Fractional labelling of glycine (e) 
and serine (f) is unchanged by FTCD depletion in HEL and Ramos cells 
in vehicle- and methotrexate-treated cells. Glycine and serine levels were 
normalized to isotopically labelled glutamate as an internal standard. 

P values were calculated for the unlabelled fraction by one-way ANOVA. 
n= 3, biological replicates. Error bars indicate s.e.m. UROC1, urocanate 
hydratase 1. 
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Extended Data Fig. 3 | FTCD depletion enables cancer cells to methotrexate-treated HEL and Ramos cells. 5,10-Methenyl THF levels 
maintain THF pools and nucleotide synthesis even when treated with were normalized to aminopterin as an internal standard. P values were 
methotrexate (part 2). a, b, Pool sizes of glycine (a) and serine (b) are not calculated by one-way ANOVA. n=3 biological replicates. Error bars 
significantly different between FTCD-depleted and control cells. Glycine indicate s.e.m. d, HEL cells were treated with 5|1M methotrexate for 
and serine levels were measured in vehicle- or methotrexate-treated HEL 48h, and Ramos cells were treated with 201M methotrexate for 72h 
and Ramos cells. P values were calculated by one-way ANOVA. n= 3, to decrease THF levels and nucleotide synthesis in wild-type cells. The 
biological replicates. Error bars indicate s.e.m. c, d, Greater cellular pool medium was then replaced with [U-!3C]serine-containing medium, 


of 5,10-methenyl THF in methotrexate-treated cells after FTCD depletion. _ and cells were incubated in [U-'°C]serine plus methotrexate for an 
[U-’C]serine labelling of 5,10-methenyl THF showed higher abundance additional 24h followed by cell collection and LC-MS analysis. The higher 


of unlabelled 5,10-methenyl THF in FTCD-depleted cells, compared to abundance of unlabelled 5,10-methenyl THF suggests higher availability 
control cells. This suggests higher availability of 5,10-methenyl THF in of reduced folate at the time of labelling onset. 5,10-Methenyl THF 

these cells at the time of labelling onset, which agrees with the higher levels were normalized to aminopterine as an internal standard. P values 
levels of both THF and 5,10-methenyl THF in the methotrexate-treated were calculated for the unlabelled fraction by one-way ANOVA. n=3, 
FTCD-depleted cells compared to the methotrexate-treated control cells. biological replicates. Error bars indicate s.e.m. 


c, 5,10-Methenyl THF levels were measured by LC-MS in vehicle- and 
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Extended Data Fig. 4 | The histidine degradation pathway affects 

the sensitivity of cancer cells to methotrexate and HAL expression 

is associated with treatment response in patients with ALL (part 1). 

a, Genetic depletion of the enzymes HAL and AMDHD1 decreased 
sensitivity to methotrexate, but not to a control drug, doxorubicin. Cell 
viability after treatment with varying concentrations of methotrexate 
and doxorubicin was used to calculate the ECgo values. n = 3, biological 
replicates. Error bars indicate s.e.m. b, Expression levels of AMDHD1 (left) 
and FTCD (right) were not significantly different across methotrexate- 
sensitive haematopoietic cell lines compared to methotrexate-resistant 
cell lines. The response to methotrexate was determined in a pooled 
fashion using genomically barcoded cell lines (Fig. 1a and Extended Data 
Fig. la—c). Expression levels of AMDHD1 and FTCD were measured 

by qPCR and normalized to the average of control genes (UBC and 
HPRT1). P values were calculated using the Kolmogorov-Smirnov test. 
n= 4 for resistant cell lines and n= 6 for sensitive cell lines (biologically 


independent samples). Each qPCR included three technical replicates. 
Centre bars indicate mean, error bars indicate s.e.m. c. Fractional labelling 
of glycine by [U-'°C]serine is unchanged by HAL depletion in HEL 

and Ramos cells in vehicle-and methotrexate-treated cells. d, Uptake 

of [U-°C]serine is higher in methotrexate-treated control cells but not 

in methotrexate-treated HAL-deficient cells. e, Glycine levels are not 
significantly different between HAL-deficient and control cells except for 
HEL cells treated with methotrexate. f, Serine levels are not significantly 
different between HAL-deficient and control cells except for HEL cells 
treated with methotrexate, in which HAL-deficient HEL cells have similar 
levels of serine to vehicle-treated cells. For c-f, glycine and serine levels 
were normalized to isotopically labelled glutamate as an internal standard. 
P values were calculated for the unlabelled fraction (c, d) or for total values 
(e, f) by one-way ANOVA. n = 3, biological replicates. Error bars indicate 
s.e.m. 
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Extended Data Fig. 5 | The histidine degradation pathway affects 

the sensitivity of cancer cells to methotrexate and HAL expression is 
associated with treatment response in patients with ALL (part 2). 

a, Validation of genetic depletion of HAL by the CRISPR-Cas9 system 

in NCIH1666 and EOL-1 cells. Expression levels of HAL were measured 
by qPCR and normalized to the average of two control genes, UBC and 
HPRT1. P values were calculated by two-way ANOVA. n=3 technical 
replicates. Error bars indicate s.d. b-d, Expression levels of SLC19A1 (b), 
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AMDHDI (c) or FTCD (d) do not predict survival of paediatric patients 
with ALL that are treated with a regimen that includes methotrexate. 
Kaplan-Meier curves of overall survival of patients with ALL with high 
(top quantile, coloured green) and low (bottom quantile, coloured blue)™* 
expression of each of the assayed genes. Patient sample size for each group 
is indicated. P values were calculated using the log-rank (Mantel-Cox) 
test. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | In vivo histidine supplementation increases flux _ images) and after (bottom image) five days of treatment. All mice are 


through the histidine degradation pathway and sensitizes tumours presented. Mice are numbered in colour by their experimental group. For 
to methotrexate (part 1). a, In vivo imaging of luciferase-expressing b, d, SEM-cell-derived tumour-bearing mice: vehicle (n = 6), histidine 
HEL tumour xenografts. Mice were imaged before (top images) and supplementation (n =7), methotrexate (n = 7), methotrexate + histidine 
after (bottom image) five days of methotrexate treatment alone or in supplementation (n= 7). c. Additional images of H&E analyses of 
combination with histidine supplementation. For a, c, HEL-cell-derived HEL-cell-derived tumour sections from methotrexate-treated and 
tumour-bearing mice: vehicle (n =5), histidine supplementation (n = 4), methotrexate + histidine supplementation-treated mice. d. H&E images of 
methotrexate (n = 6), methotrexate + histidine supplementation (n =6). SEM-cell-derived tumour sections from all treatment groups. Three mice 
Four mice per group are presented. b, In vivo imaging of luciferase- per group are presented. 


expressing SEM tumour xenografts. Mice were imaged before (top 
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Extended Data Fig. 7 | In vivo histidine supplementation increases flux supplementation (n = 7). b, H&E analyses of kidney sections from 


through the histidine degradation pathway and sensitizes tumours to HEL-cell-derived tumour-bearing mice from all experimental groups. 
methotrexate (part 2). a, Higher magnification of H&E images oftumour —_ For HEL-cell-derived tumour-bearing mice: vehicle (n = 5), histidine 
sections from SEM-cell-derived tumour-bearing mice from all groups. supplementation (n = 4), methotrexate (n = 6), methotrexate + histidine 
For SEM-cell-derived tumour-bearing mice: vehicle (n =6), histidine supplementation (n =6). 


supplementation (n= 7), methotrexate (n =7), methotrexate + histidine 
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Extended Data Fig. 8 | In vivo histidine supplementation increases 
flux through the histidine degradation pathway and sensitizes 
tumours to methotrexate (part 3). THF levels decreased after 
methotrexate treatment and decreased even further when methotrexate 
was combined with histidine supplementation. Methotrexate levels were 
not different in tumours from mice treated with methotrexate alone or 
in combination with histidine supplementation. Nucleotide abundance 
was significantly lower in tumours from mice treated with methotrexate 
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and histidine supplementation compared to tumours from vehicle- 
treated mice. P values were calculated using non-parametric one-way 
ANOVA for all comparisons (Kruskal-Wallis test, Dunn’s post hoc 
test), except for methotrexate, for which P values were calculated by a 
two tailed t-test. Vehicle (n = 6), histidine (n = 6), methotrexate (n = 6), 
methotrexate + histidine (n =7). Centre bars indicate mean, error bars 
indicate s.e.m. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | In vivo histidine supplementation increases flux 
through the histidine degradation pathway and sensitizes tumours to 
methotrexate (part 4). a, H&E analyses of liver sections from HEL- 
cell-derived tumour-bearing mice from all experimental groups. For 
HEL-cell-derived tumour-bearing mice: vehicle (n = 5), histidine 
supplementation (n = 4), methotrexate (n = 6), methotrexate + histidine 
supplementation (n = 6). b, Weight loss, presented as a percentage of 

the weight measured on the first experimental day, before treatment. 

For SEM-cell-derived tumour-bearing mice: vehicle (n = 6), histidine 
supplementation (n =7), methotrexate (n = 7), methotrexate + histidine 
supplementation (n = 7). P value was calculated using a non-parametric 
t-test (Mann-Whitney). c, d, Metabolites of the histidine degradation 
pathway were increased in HEL tumours from mice treated with histidine 
supplementation. Histidine (left) and FIGLU (right) levels were measured 
in tumours by LC-MS and normalized to isotopically labelled histidine 

as an internal standard. P values were calculated using a non-parametric 
t-test (Mann-Whitney). All metabolites measured in tumours were 


normalized to an average of four amino acids (phenylalanine, leucine, 
valine and tyrosine) as an internal loading control. For HEL-cell-derived 
tumour-bearing mice: vehicle (n= 5), histidine supplementation (n = 4), 
methotrexate (n = 5), methotrexate + histidine supplementation (n =6). 
For SEM- cell-derived tumour-bearing mice: vehicle (n = 6), histidine 
supplementation (n = 6), methotrexate (n = 6), methotrexate + histidine 
supplementation (n= 7). e, f, Plasma levels of methotrexate, histidine, 
5-methyl THF and folate. Methotrexate was detected in the plasma 

of methotrexate-treated mice only (left). No significant difference in 
histidine levels was detected (second from left). Levels of 5-methyl THF 
were significantly lower in the plasma of methotrexate-treated mice 
(second from right). Folate levels increased in the plasma of methotrexate- 
treated mice (right). Metabolite levels were measured from fresh plasma 
samples by LC-MS and normalized to isotopically labelled histidine or to 
aminopterin as an internal standard. P values were calculated using non- 
parametric one-way ANOVA. Group sizes are the same as in c, d. Centre 
bars indicate mean, error bars indicate s.e.m. 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | In vivo histidine supplementation sensitizes 
tumours to methotrexate without enhancement of treatment toxicity 
(part 1). a, We evaluated whether methotrexate treatment combined 
with histidine supplementation might be more toxic than methotrexate 
alone by setting up a longer treatment regime of 15 days with a recovery 
period of two weeks. During the experiment we monitored weight loss 
and observed no difference between mice treated with methotrexate 
alone and those treated with methotrexate and histidine. NOD-SCID 
mice were injected with HEL cells subcutaneously on day 1, followed 

by weight measurement every other day, in vivo imaging of HEL-cell- 
derived tumours (on days 7, 15, 20 and 28), treatment on days 12 to 23 
(vehicle, histidine and methotrexate injections every other day), and final 
termination of the experiment after 40 days, unless an early euthanization 
was required in accordance with the guidelines for humane experimental 
end-point of the Committee on Animal Care at MIT. The experiment 
included four experimental groups: vehicle-treated (saline) (n=7), 
histidine supplementation (n = 8), methotrexate-treated (n = 8), and 


histidine supplementation combined with methotrexate treatment (n = 8). 


Serum was collected at day 0 and day 23 for metabolite profiling and liver 
diagnostics. The dose of methotrexate used was 25 mg kg”! based on the 
weight measured at day 0. The dose of histidine was 18 mg per injection 
in 400 ul saline. b, A significant reduction in tumour size over time was 
found in mice treated with the combination of methotrexate and histidine 
supplementation. Tumours were imaged in vivo by luciferase expression 
at the indicated days. Fold changes in tumour sizes over measurements 
performed on day 7 are presented. P values were calculated by non- 
parametric one-way ANOVA. Group size changed over time owing to 
mouse euthanization for humane reasons. On day 15: vehicle-treated 
(n=5), histidine supplementation (n = 7), methotrexate-treated (n =7) 


and histidine supplementation combined with methotrexate treatment 
(n=8). On day 20: vehicle-treated (n = 6), histidine supplementation 
(n=6), methotrexate-treated (n = 7) and histidine supplementation 
combined with methotrexate treatment (n = 8). On day 28: vehicle-treated 
(n=6), histidine supplementation (n = 5), methotrexate-treated (n=7) 
and histidine supplementation combined with methotrexate treatment 
(n= 8). ¢, In vivo imaging of luciferase-expressing HEL-cell-derived 
tumours at days 7 (top) and 28 (bottom). Mice are labelled by number, 
with the colour of the label representing their experimental group. All 
mice assigned to the experiment are shown, group size is the same as 

in a. d, e, There was no increase in the abundance of serum markers 
indicative of kidney damage in mice treated with methotrexate + histidine 
supplementation compared to mice treated with methotrexate. Markers of 
kidney toxicity (urea and creatinine) were measured by LC-MS in serum 
samples of the tested mice. The relative abundance of urea and creatinine 
was normalized to isotopically labelled valine and tryptophan as an 
internal standard. P values were calculated using non-parametric one-way 
ANOVA. Group size: vehicle-treated (n = 7), histidine supplementation 
(n=7), methotrexate-treated (n= 8), and histidine supplementation 
combined with methotrexate treatment (n = 6). f, g, Some increase in liver- 
toxicity markers in mice treated with the combined therapy compared to 
mice treated with methotrexate alone. Markers of liver toxicity (alanine 
aminotransferase (ALT) and aspartate aminotransferase (AST)) were 
measured by an external serum diagnostics laboratory (IDEXX) in serum 
samples of the tested mice. Measurement units are indicated. P values were 
calculated using non-parametric one-way ANOVA. Group size: vehicle- 
treated (n = 6), histidine supplementation (n = 6), methotrexate-treated 
(n=6), and histidine supplementation combined with methotrexate 
treatment (n= 5). Centre bars indicate mean, error bars indicate s.e.m. 


© 2018 Springer Nature Limited. All rights reserved. 


LETTER 


a Kidneys from mice with HEL cell-derived tumor xenografts 
methotrexate + 


histidine supplementation 


vehicle 


histidine supplementation 


w= 


methotrexate 
ee 


oaks mo 
pi a 5 


scale bar = 25 um 


b Livers from mice with HEL cell-derived tumor xenografts 
vehicle 


methotrexate + 
histidine supplementation 
) 


histidine supplementation methotrexate 


scale bar = 100 um 


Extended Data Fig. 11 | In vivo histidine supplementation sensitizes at the conclusion of the experiment, after two weeks post-treatment 
tumours to methotrexate without enhancement of treatment toxicity recovery. Group sizes are the same as in Extended Data Fig. 9a. b, H&E 
(part 2). a, b, Histological analyses indicated that the kidney and liver analyses of liver sections from mice from all experimental groups. Group 
appeared normal at the end of the two-week recovery period. See sizes are the same as in Extended Data Fig. 9a. Tissues were collected at the 
also Supplementary Information Fig. 3a. a, H&E analyses of kidney conclusion of the experiment, after two weeks post-treatment recovery. 
sections from mice from all experimental groups. Tissues were collected See also Supplementary Information Fig. 3b. 
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Glucose-regulated phosphorylation of TET2 by 
AMPK reveals a pathway linking diabetes to cancer 


Di Wu!?”, DiHu!*”, Hao Chen!*’, Guoming Shi)’, Irfete S. Fetahu?, Feizhen Wu!, Kimberlie Rabidou’, Rui Fang’, Li Tan!, 
Shuyun Xu‘, Hang Liu!, Christian Argueta’, Lei Zhang?, Fei Mao®, Guoquan Yan‘, Jiajia Chen®, Zhaoru Dong’, Ruitu Lv!, 
Yufei Xu’, Mei Wang’, Yong Ye!, Shike Zhang’, Danielle Duquette’, Songmei Geng’, Clark Yin’, Christine Guo Lian‘, 

George F. Murphy‘, Gail K. Adler’, Rajesh Garg?, Lydia Lynch*, Pengyuan Yang°, Yiming Li®°, Fei Lan’, Jia Fan!, Yang Shi!? 


& Yujiang Geno Shi!* 


Diabetes is a complex metabolic syndrome that is characterized 
by prolonged high blood glucose levels and frequently associated 
with life-threatening complications”. Epidemiological studies 
have suggested that diabetes is also linked to an increased risk 
of cancer?*. High glucose levels may be a prevailing factor that 
contributes to the link between diabetes and cancer, but little is 
known about the molecular basis of this link and how the high 
glucose state may drive genetic and/or epigenetic alterations that 
result in a cancer phenotype. Here we show that hyperglycaemic 
conditions have an adverse effect on the DNA 5-hydroxymethylome. 
We identify the tumour suppressor TET2 as a substrate of the AMP- 
activated kinase (AMPK), which phosphorylates TET2 at serine 
99, thereby stabilizing the tumour suppressor. Increased glucose 
levels impede AMPK-mediated phosphorylation at serine 99, which 
results in the destabilization of TET2 followed by dysregulation of 
both 5-hydroxymethylcytosine (ShmC) and the tumour suppressive 
function of TET2 in vitro and in vivo. Treatment with the anti- 
diabetic drug metformin protects AMPK-mediated phosphorylation 
of serine 99, thereby increasing TET2 stability and 5hmC levels. 
These findings define a novel ‘phospho-switch’ that regulates TET2 
stability and a regulatory pathway that links glucose and AMPK 
to TET2 and 5hmC, which connects diabetes to cancer. Our data 
also unravel an epigenetic pathway by which metformin mediates 
tumour suppression. Thus, this study presents a new model for how 
a pernicious environment can directly reprogram the epigenome 
towards an oncogenic state, offering a potential strategy for cancer 
prevention and treatment. 

DNA methylation (5mC) and hydroxymethylation (ShmC) are 
epigenetic modifications that are frequently perturbed in cancer®*. 
The conversion of 5mC to 5hmC occurs through an oxidative reac- 
tion catalysed by the ten-eleven translocation (TET) protein family of 
dioxygenases (TET1, TET2 and TET3)*"!!. The reaction requires 
a-ketoglutarate, a metabolite that is influenced by the availability 
of glucose and glutamine!!%, This led us to predict that a hypergly- 
caemic state would increase the levels of 5hmC in blood cells from 
patients with diabetes. To test this possibility, we examined global 
5hmC in gDNA extracted from peripheral blood mononuclear cells 
(PBMC) collected from a cohort of healthy donors (haemoglobin Alc 
(HbA 1c) 5.5 +0.26%) and patients with diabetes (HbA 1c 10.7 + 1.9%, 
Fig. 1a). Unexpectedly, samples from patients showed a significant 
(P=0.0017) decrease in 5hmC levels compared to the healthy donors 
(Fig. 1b, Extended Data Fig. 1a), whereas 5mC levels remained the same 
(Fig. 1c). The presence of diabetes in the patient group was the strongest 
predictor of low 5hmC levels, with HbA 1c levels showing a significant 
(P <0.05) inverse correlation with 5hmC (Extended Data Table 1a). 


To investigate this inverse correlation, we cultured several cell lines 
under normal glucose (1 g 17!) and high glucose (4.5 g1~') conditions. 
A subset of these cell lines (PBMC, HUVEC and TF-1) exhibited signif- 
icantly (P=0.022 (PBMC), 0.046 (HUVEC), 0.047 (TF-1)) lower levels 
of 5hmC when subjected to high as opposed to normal glucose. The 
other cell lines (A375, A2058 and SK-MEL-5) did not show apparent 
changes in 5hmC levels between the two glucose conditions, as they 
have low baseline levels of 5amC (Extended Data Fig. 1b). 

Loss of 5hmC is an epigenetic hallmark of cancer, in which dimin- 
ished levels of TET2 expression have an important role!+!>. We 
hypothesized that the alterations in 5hmC in response to glucose were 
mediated through TET2, as TET1 and TET3 were barely detectable in 
these cells. Indeed, glucose-responsive cells (PBMC, HUVEC and TF-1) 
showed decreased TET2 in high glucose, whereas TET2 remained low 
and unaltered in the glucose-nonresponsive cells (A2058, A375 and 
SK-MEL-5) (Extended Data Fig. 1c). To unravel the specific role of 
TET2 in this modulation, we used an A2058-TET2WTI stable cell line, 
in which the expression of TET1 and TET3 is low and the expression of 
TET2 and 5hmC levels have been restored’ (Extended Data Fig. 1d, e). 
High glucose reduced the level of TET2 protein in TET2WT cells, 
whereas the TET2 mRNA level remained unchanged (Extended Data 
Fig. 1f, g). Notably, the protein half-life of TET2 was substantially 
lower under high glucose than under normal glucose (Extended Data 
Fig. 1h). Unlike native A2058 cells (Extended Data Fig. 1b), A2058- 
TET2WT cells showed a significant (P = 0.034) increase in 5hmC levels 
after 7 days of culture in normal glucose. This increase, however, 
could be reversed (within 24 h) by switching the culturing medium 
to high glucose (Fig. 1d, Extended Data Fig. li, j; see Supplementary 
Information for details). Such modulation was not observed in stable 
A2058 cell lines expressing mock control (mock), catalytically inactive 
full-length TET2 (TET2M), or catalytically active C-terminal TET2 
(TET2CD) (Fig. le, Extended Data Fig. 1k). These data suggest that 
functional, full-length TET2 is required to mediate the reversible 
changes in 5hmC in response to extracellular glucose availability. 

To identify gene-specific 54mC changes in response to glucose, we 
analysed genome-wide alterations in 5hmC in A2058-TET2WT cells 
using hydroxymethylated DNA immunoprecipitation coupled with 
deep sequencing (hMeDIP-seq) and methylated DNA immunopre- 
cipitation coupled with deep sequencing (MeDIP-seq). These analyses 
showed that ShmC was higher under normal glucose than high glucose, 
whereas there was no significant difference in 5mC (Fig. 1f, Extended 
Data Fig. 2a). We identified 30,217 differentially 5-hydroxymethylated 
regions (DhMRs), with more than 80% (about 24,537) of the regions 
being increased under normal glucose (Extended Data Fig. 2b, c). 
The majority of this DhMR enrichment (65.4%) was associated with 
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Fig. 1 | Hyperglycaemia deregulates global DNA 5hmC levels, and 

this regulation requires a functional, full-length TET2. a, Comparison 
of HbA Ic between 28 healthy donors and 29 patients with diabetes. 
ek PD — 2.88 x 10-15. Box plot: centre lines, median; limits, upper and 
lower quartiles; top and bottom whiskers, 10% and 90% percentiles; 
points, outliers. b, High-performance liquid chromatography with tandem 
mass spectrometry (HPLC-MS/MS) analysis of ratio of 5hmC to total 
Cin randomly selected gDNA from 15 healthy donors and 18 patients. 
48% D— 7 44 x 1079. c, HPLC-MS/MS analysis of ratio of 5mC to total C 
from the same samples as in b. d, Left, dot blot of 5amC in DNA extracted 
from A2058-TET2WT cell line cultured in high glucose (high-g) or 
normal glucose (normal-g), or switched from normal to high glucose 
(high-g*). Right, quantification of dot blots. *P = 0.034 (high-g), P= 0.026 
(high-g*). e, Dot blot comparison of 5hmC levels in A2058-TET2WT, 
mock (an A2058 cell line stably expressing empty vector), A2058-TET2M, 
and A2058-TET2CD cell lines cultured in high and normal glucose. 

f, Normalized hMeDIP-seq 5hmC tag density distribution across gene 
bodies in A2058-TET2WT cells cultured in high glucose (red) or normal 
glucose (blue). Each gene body was normalized relative to position 
percentage within the gene. g, hMeDIP-seq results of representative genes 
in which 5hmC is increased across the gene body in normal glucose. 
y-axis represents 5hmC density. h, Unguided clustering of genes that 

are differentially expressed when TET2 is present (in red frame) and 

not differentially expressed when TET2 is absent. Mut, A2058-TET2M. 

i, RT-qPCR validation of a subset of cancer-associated genes identified 

in h. Data represent three technical repeats in i and three biologically 
independent repeats in d, e. Two-sided Student's t-test, data shown as 
mean s.d., *P < 0.05, ****P< 0.0001. 


gene regions, with 9.56% localizing to promoters and 55.84% to gene 
bodies (Fig. 1g, Extended Data Fig. 2d). Notably, gene ontology and 
disease ontology analyses of these DhMR enriched genes showed strong 
associations with cancer and cancer-related pathways (Extended Data 
Fig. 2e). 

We next used microarrays to compare transcriptomes of mock, A2058- 
TET2M and A2058-TET2WT cells cultured in normal or high glucose 
(Fig. 1h). This analysis identified 585 genes that were differentially 
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Fig. 2 | TET2 is a substrate of AMPK, which specifically phosphorylates 
TET2 at S99. a, Conserved AMPK recognition motif of several key AMPK 
target enzymes, including TET2. b, Co-immunoprecipitation showing that 
Flag-TET2 interacts with the activated form of AMPK (AMPKpT172). 

c, Flowchart of the in vitro kinase assay with different detection 
approaches. d, LC-MS/MS profiling of the in vitro AMPK kinase assay 
showed that TET2S99 is specifically phosphorylated in the presence of 
active AMPK. e, Immunoblot validation of TET2p$99 phosphorylation 
using a TET2pS99-specific antibody. f, Determination of TET2 
phosphorylation at $99 via 3*P-autoradiograph (SLF is a naturally 
occurring mutant that probably disrupts the AMPK binding site). 

Data represent three biologically independent repeats in b, e, f. 


expressed in normal glucose versus high glucose when TET2 was 
present (A2058-TET2WT), but not in mock or A2058-TET2M cells 
(Fig. 1h, Extended Data Table 1b). These glucose-modulated 
and TET2-dependent genes are involved in cell cycle regulation and are 
associated with various cancers (Extended Data Fig. 3a, b). Among these 
genes, we found several tumour suppressor and tumour promoting 
genes that were downregulated and upregulated, respectively, under the 
high glucose condition, and validated a subset of these using quantitative 
PCR with reverse transcription (RT-qPCR) (Fig. 1i). Furthermore, 
we found that 213 of the 585 genes (36.4%) contained increased 
DhMRs in normal glucose. Gene ontology and disease ontology 
analysis revealed that these 213 genes were also highly associated 
with cell cycle regulation and various types of cancers (Extended Data 
Fig. 3c). Of note, these genes consist of both upregulated and down- 
regulated genes (Extended Data Fig. 3d), suggesting that ShmC has a 
complex role in gene regulation, as previously reported!®””. 

AMPK isa key nutrient or energy sensor that is highly sensitive to 
glucose availability'’. TET2 contains two putative AMPK motifs cen- 
tred on serines 99 (S99) and 1205 ($1205) (Fig. 2a). Furthermore, the 
activated form of AMPK (pAMPK172) was co-immunoprecipitated 
with TET2 (Fig. 2b). To show that TET2 is a substrate of AMPK, we 
carried out an in vitro AMPK kinase assay (Fig. 2c, Extended Data 
Fig. 4a). By liquid chromatography with tandam mass spectrometry 
(LC-MS/MS) analysis, we detected robust phosphorylation of TET2, 
specifically at S99, by the active form of AMPK (Fig. 2d, Extended Data 
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Fig. 3 | Glucose regulates TET2 stability via the TET2pS99 
phosphorylation switch, which is controlled by AMPK. a, Left, 
immunoblot signal of phosphoserine 99 (TET2pS99) on Flag-TET2WT 
immunoprecipitated from cells cultured in high or normal glucose. 
Right, quantification of TET2pS99 signals normalized by total TET2, 
*P=0.027. b, LC-MS/MS validation of pS99 levels in the cells under 
high and normal glucose, *P = 0.030. c, Normalized TET2, TET2pS99 
and pAMPK levels in PBMCs from healthy donors and patients with 
diabetes; n =6, ****P=1.01 x 10-6 (TET2), **P=0.0035 (TET2p$99), 
P= 5.26 x 10-4 (pAMPR). d, Flag-TET2WT half-life and pAMPK 
levels under normal glucose or normal glucose switched to high glucose. 
e, Left, dot blot showing 5hmC levels in A2058-TET2WT cells after 


Fig. 4b). There was little change to the other putative phosphorylation 
sites, including $1205. We next generated an antibody that specifically 
recognizes the phosphorylated form of TET2 S99 (Extended Data 
Fig. 4c). Immunoblotting confirmed the specificity of the antibody, as 
it gave a strong signal when used against AMPK-treated TET2WT, but 
not with untreated TET2WT or TET2 with a serine-to-alanine muta- 
tion at position 99 (TET2S99A) (Fig. 2e). Using a 32p radiolabelling 
kinase assay, we observed a marked decrease in y-ATP incorporation 
in the TET2S99A mutant as well as in the S102_L103 >F (SLF) mutant 
(http://cancer.sanger.ac.uk/cosmic). By contrast, TET2WT and the 
$1205A mutant demonstrated comparably high levels of radioactivity 
(Fig. 2f, Extended Data Fig. 4d). These findings reveal that TET2 is 
a substrate of AMPK and that $99 is the major phosphorylation site 
catalysed by AMPK. 

AMPK is known to be activated upon energy stress induced by 
glucose depletion’®”°. We hypothesized that glucose-mediated TET2 
protein stability was modulated through phosphorylation of TET2S99 
by AMPK. To test this hypothesis, we first compared TET2S99 
phosphorylation levels in A2058-TET2WT cells cultured in normal 
glucose versus high glucose. Both western blot (P=0.027) and HPLC-MS 
analysis (P= 0.030) revealed that the phosphorylation status of 
TET2S99 (TET2pS99) was significantly higher in normal as compared 
to high glucose (Fig. 3a, b, Extended Data Fig. 5a). Consistent with 
the kinetics of 5amC shown in Extended Data Fig. li, j, we observed 
steady increases in pAMPK, TET2pS99 and TET2 during days 0, 2 
and 4 after switching from high to normal glucose (Extended Data 
Fig. 5b). These increases were confirmed in other cell lines as well as in 
PBMCs (Extended Data Fig. 5c). Notably, pAMPK, TET2pS99 and total 
TET2 were significantly (P= 1.01 x 10~° (TET2), 0.0035 (TET2p$99), 
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treatment with 100 1M A769662 (an AMPK activator). Right, normalized 
quantification of dot blot signals; *P = 0.029. f, CHX treatment to measure 
the half-life of Flag-TET2WT in cells treated with (+) or without (—) 
metformin (Met; 5 mM). g, Effects of knocking down AMPKa2 on the 
protein stabilities of Flag-TET2WT and TET2S99A. Scr, scrambled 
shRNA. h, Half-lives of Flag-TET2S99D and TET2S99A. i, HPLC-MS/MS 
results showing the effects of knocking down AMPKa2 on 5hmC levels in 
TET2WT (**P=0.006 (sh1), 0.003 (sh2)) cells and cells overexpressing 
TET2S99A or TET2S99D. Data represent three biologically independent 
repeats in a, b, d-i. Two-sided Student's t-test, data shown as mean + s.d., 
*P<0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001. For gel source data, 
see Supplementary Fig. 1. 


5.26 x 10-4 (pAMPK)) higher in PBMCs from healthy donors than 
in those from patients with diabetes (Fig. 3c, Extended Data Fig. 5d). 
This is consistent with our observation that 5hmC is increased in the 
healthy group (Fig. la, b). Collectively, these data demonstrate that 
TET2 is phosphorylated at S99 by AMPK, which is suppressed under 
hyperglycaemic conditions. 

To investigate the pivotal role of AMPK-mediated phosphoryla- 
tion in TET2 stability and 5hmC levels, we measured the half-life of 
TET2 and corresponding AMPK activity in cells that were initially 
cultured in normal glucose and were then switched to high glucose or 
kept in normal glucose. As indicated by pAMPK (pThr172), AMPK 
retained phosphorylation activity in cells that were continually 
cultured in normal glucose, whereas it was deactivated in cells switched 
to high glucose. Concomitantly, we observed that the half-life of 
TET2 protein was substantially reduced in cells that were switched to 
high glucose (Fig. 3d). Next, we treated cells with AMPK activators 
(metformin or A769662)”), As predicted, the activators elevated the 
levels of pAMPK, TET2pS99, total TET2 (Extended Data Fig. 6a, b) 
and 5hmC (Fig. 3e). When cells cultured in high glucose were treated 
with metformin or A769662, the half-life of TET2 increased notably 
(Fig. 3f, Extended Data Fig. 6c, d). Furthermore, metformin 
treatment also increased 5hmC in a subset of DhMRs previously 
identified to be increased under normal glucose (Extended Data 
Fig. 6e). To demonstrate that AMPK directly regulates TET2 protein 
stability, we used short hairpin RNAs (shRNAs) to specifically knock- 
down AMPKa2, an AMPKcaisoform enriched in the nucleus”, in A2058 
cells expressing TET2WT or TET2S99A, the non-phosphorylatable 
mutant. AMPKa2 shRNAs effectively depleted AMPKa2 and resulted 
in a marked decrease in TET2WT protein but not in the S99A mutant 
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Fig. 4 | Hyperglycaemia promotes xenografted tumour growth, 

which is suppressed by TET2 and further prevented by metformin. 

a, b, Representative images of tumour-bearing diabetic (a) and non- 
diabetic (b) nude mice xenografted with A2058-TET2WT (left) or mock 
tumour cells (right) treated with (++) or without (—) metformin. 

c, d, Quantitative comparison of tumour size and efficacy of metformin’s 
tumour suppressive function on A2058-TET2WT and mock tumours in 
diabetic (c) and non-diabetic (d) nude mice. c, n=5, ***P=8.3 x 10-4 
(TET2 versus mock), *P=0.048 (TET2 versus TET2 + Met). 

d,n=4-5, ****P—5.2 x 10-° (TET2 versus mock), 0.0046 (TET2 versus 
TET2 + Met). Two-sided Student’s t-test, data shown as mean +s.e.m. 
*P<0.05, **P< 0.01, ***P < 0.001, ****P < 0.0001; NS, not significant. 


(Fig. 3g). These data suggest that AMPK and TET2S99 phosphorylation 
are important for protecting TET2 stability. 

To identify the role of S99 phosphorylation, we generated another 
A2058 stable cell line expressing a phospho-mimic mutant, TET2S99D 
(Extended Data Fig. 7a, b).When treated with cycloheximide (CHX), 
the TET2S99A mutant exhibited less stability than the TET2S99D 
mutant (Fig. 3h). Consistent with greater protein stability, the 
TET2S99D cells had higher ShmC than TET2S99A cells in high glucose 
(Fig. 3i). After knocking down AMPK, we observed a marked drop in 
5hmC levels in wild-type cells, with no such effect in the TET2S99A 
or S99D cells (Fig. 3i, Extended Data Fig. 7c). 

Previous studies have shown that the stability of TET2 protein is 
regulated by calpain family proteases”, and phosphorylation can protect 
proteins from calpain cleavage”. Thus, we hypothesized that phos- 
phorylation of $99 protects TET2 from calpain-mediated degradation. 
Indeed, treatment with a calpain inhibitor strongly stabilized TET2S99A 
and increased the protein level 3.7-fold, whereas only marginal 
effects were seen with TET2WT and TET2S99D (Extended Data 
Fig. 7d). Together, these results indicate that TET2pS99 is a ‘phospho- 
switch that is regulated by AMPK and is critical for the regulation of 
the stability of TET2 and levels of 5hmC. 

To investigate how this AMPK-TET2 axis translates sustained high 
glucose exposure into a cancer-prone phenotype, we investigated 
the effects of TET2 and glucose levels on cell proliferation. A2058- 
TET2WT cells showed significantly (P= 5.68 x 107°) higher cell 
proliferation rates in high glucose than in normal glucose, whereas 
mock cells showed little effect (Extended Data Fig. 8a). Notably, this 
TET2-dependent and glucose-influenced growth phenotype was 
validated in an independent model, TF-1 cells? (Extended Data 
Fig. 8b-d). These data suggest that TET2 has an important role in 
glucose-modulated tumour cell growth. 

There is growing evidence that metformin, a widely used anti- 
diabetic drug, is also a potential anti-cancer agent”””*. We inves- 
tigated whether the effect of metformin on cell proliferation also 
involves the AMPK-TET2-5hmC axis. We treated A2058-TET2WT, 
A2058-TET2S99A and mock cells with metformin and detected 
a significant (P= 0.026) increase in 5hmC levels in TET2WT cells, 
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but not in TET2S99A or mock cells (Extended Data Fig. 8e). Using 
MTS (3-(4,5-dimethylthiazol-2-yl)-5-(3-carboxymethoxypheny]l)-2- 
(4-sulfophenyl)-2H-tetrazolium) proliferation assays, we examined the 
growth of cells treated with metformin and observed that metformin 
significantly (P=7.18 x 10~*) reduced the proliferation of TETZ2WT 
cells, but had no effect on TET2S99A or mock cells (Extended Data 
Fig. 8f). 

Next, we analysed the anchorage-independent growth of TET2WT, 
TET2S99A and mock cells. Restoring TET2 expression reduced 
anchorage-independent growth of A2058 cells. However, this 
suppression was compromised in A2058 cells expressing TET2S99A 
(Extended Data Fig. 8g, h, 0 mM). This phenomenon could be 
recapitulated in an independent cell line, MDA-MB-231 (Extended 
Data Fig. 8i-l). Notably, metformin further reduced colony growth 
in a dose-dependent manner, adding another layer of suppression to 
A2058-TET2WT cells. By contrast, the TET2S99A mutant and mock 
cells did not show significant growth inhibition upon metformin treat- 
ment (Extended Data Fig. 8g, h, 2-4 mM). Collectively, these results 
demonstrate in vitro that metformin requires the AMPK-TET2-5hmC 
axis to execute its anti-tumour effects. 

To validate our hypothesis in vivo, we first generated BALB/c nude 
mice xenografted with TET2WT or mock A2058 tumours under 
both diabetic and non-diabetic conditions, as outlined in Extended 
Data Fig. 9a, b. In support of the role of TET2 as a tumour suppressor, 
TET2WT tumours were significantly smaller than those of the mock 
control in both diabetic (Fig. 4a, c; P=8.3 x 10~*) and non-diabetic 
mice (Fig. 4b, d; P=5.2 x 10~°). Notably, TET2WT tumours in 
diabetic mice were significantly larger than those in non-diabetic mice 
(Extended Data Fig. 9e, f; P= 0.026 without metformin treatment, 
P=0.0023 with metformin treatment). By contrast, there was little 
difference between mock tumours grown in diabetic or non-diabetic 
mice. These results are consistent with a mechanism in which a 
sustained diabetic or hyperglycaemic environment impairs TET2- 
5hmC-mediated tumour suppression. 

Next, we investigated whether we could observe the anti-tumour 
effects of metformin, also operating through the AMPK-TET2-5hmC 
axis, in vivo. We found that metformin imposed an additional layer of 
suppression on TET2WT tumours in both diabetic and non-diabetic 
mice (Extended Data Fig. 9c, d). By contrast, mock tumours showed 
little, if any, difference in size under the same treatments. Notably, when 
we depleted TET2 expression in A2058-TET2WT cells, the tumours 
behaved similarly to mock tumours and were no longer suppressed by 
metformin (Extended Data Fig. 9g, h). We should note that metformin 
did not change blood glucose levels, consistent with a previous report” 
(Extended Data Table 1c). Nevertheless, immunohistochemical (IHC) 
staining for pAMPK showed that metformin treatment effectively 
increased pAMPK in both TET2WT and mock tumours (Extended 
Data Fig. 10a, c). The levels of 5amC, however, were increased only in 
TET2WT tumours treated with metformin (Extended Data Fig. 10b, d). 
These data demonstrate in vivo that the anti-tumour effect of met- 
formin requires a functional TET2 protein and acts downstream of the 
influence of glucose upon the AMPK-TET2-5hmC axis. 

In summary, we have shown that sustained hyperglycaemia 
destabilizes the tumour suppressor TET2 and deregulates levels 
of 5hmC. We describe an environment-to-epigenome signalling 
pathway, the glucose-AMPK-TET2-5hmC axis, which links the level 
of extracellular glucose to the dynamic regulation of 54mC—and, 
ultimately, diabetes to cancer. We have identified AMPK-mediated 
TET2 phosphorylation at S99 as a molecular switch that controls a 
pivotal step in the glucose-AMPK-TET2-5hmC axis. Disabling 
this switch causes calpain-mediated degradation of TET2, resulting 
in a dysregulated hydroxymethylome and transcriptome. Notably, 
the anti-tumour effect of metformin requires a functional AMPK- 
TET2-5hmC axis. We propose that the glucose-AMPK-TET2-5hmC 
axis, which may work separately or in conjunction with other cellular 
pathways such as those involving mTOR”, is a signalling pathway by 
which cancer-promoting environmental cues are directly linked to 
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the reprogramming of a cancer-favourable epigenome. This epige- 
netic regulation may have a direct effect on the efficacy of metformin 
in preventing cancer, thus providing novel avenues for future clinical 
investigation. 
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Extended Data Fig. 1 | See next page for caption. 
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Extended Data Fig. 1 | Biochemical and molecular analysis of 5amC 
and TET2 in response to glucose in primary blood cells and cultured 
cell lines. a, Dot blot comparison of global 5hmC levels between PBMC 


gDNA from 28 healthy donors and 29 patients with diabetes, **P = 0.0017. 


b, Genomic DNA extracted from several cell types cultured under high 
or normal glucose were dot blotted for 5amC, *P =0.022 (PBMC), 0.046 
(HUVEC), 0.047 (TF-1). c, Western blot revealed that endogenous TET2 
protein levels in PBMCs, HUVECs and TF-1 cells were higher when 
cultured in normal glucose than in high glucose. TET1 and TET3 protein 
levels were minimally detectable in these cell lines. d, e, RT-qPCR (d) 
and western blot (e) showed that TET2 dominated amongst the TET 
family in A2058-TET2WT cells. In A2058 cells, expression of all three 
genes was extremely low. f, Flag-TET2 protein levels in whole-cell lysates 
from A2058-TET2WT cells cultured in high glucose or normal glucose. 
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g, Comparable levels of Flag-TET2 mRNA from A2058-TET2WT cells 
cultured in high glucose or normal glucose. h, Half-lives of Flag-TET2 in 
A2058-TET2WT cells cultured in high glucose or normal glucose. i, High 
glucose shock resulted in an acute increase in 5hmC in A2058-TET2WT 
cells followed by a rapid drop to baseline. j, Long-term culturing of 
A2058-TET2WT cells in normal glucose resulted in a sustained increase 
in 5hmC, which could be reversed by switching the medium to high 
glucose. See Supplementary Information for more details. k, Dot blot 
quantification of 5hmC levels in A2058-TET2WT, mock, A2058-TET2M, 
and A2058-TET2CD cell lines in Fig. le; **P = 0.0068. Data in b-k 
represent three biologically independent repeats each. Two-sided Student’s 
t-test, data shown as mean +s.d. *P < 0.05, **P < 0.01. For gel source data, 
see Supplementary Fig. 1. 
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normalized relative to position percentage within the gene. In contrast 
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b, Total peak numbers for differentially 5-hydroxymethylated regions 
(DhMRs) when comparing normal glucose to high glucose. We identified 
30,217 DhMRs in total, among which the majority (>80%, ~24,537) were 
increased in normal glucose, whereas <20% were decreased. c, hMeDIP- 
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Extended Data Fig. 3 | Gene ontology and functional disease ontology 
analysis of genes that were transcriptionally regulated by glucose and 
TET2. a, The 585 glucose-modulated and TET2-dependent genes (Fig. 1h) 
identified in A2058-TET2WT cells were analysed by gene ontology 

(left) and disease ontology (right). These genes are enriched in cell cycle 
pathways and are strongly correlated with cancers, including carcinoma, 
colon cancer, lung cancer and embryoma. b, Detailed disease ontology 
clustering of the 585 genes showed strong association with various cancer 
types. c, Two hundred and thirteen out of the 585 genes also had increased 


1 2 3 4 5 
5hmC fold change 


DhMRs in normal glucose as compared to high glucose. Further analysis 
of this gene subset by gene ontology and disease ontology showed that 
these genes are also highly associated with cell cycle functions and strongly 
correlated with cancers. d, Of these 213 genes with increased 5hmC 

and altered gene expression in response to glucose alterations, 124 were 
upregulated and 89 were downregulated. Consistent with the intricate role 
of 5hmC in gene regulation, correlation analysis revealed no significant 
relationship between increased 5hmC and gene upregulation. 
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Extended Data Fig. 4 | Purification of full-length TET2 recombinant 
proteins and determination of TET2 S99 phosphorylation by AMPK. 
a, SDS-PAGE gel showing the quality and quantity of recombinant 
TET2WT, TET2S99A, TET2SLF and TET2S1205A proteins. 

b, Representation of TET2 S99 phosphorylation levels, as detected by 
LC-MS/MS after in vitro kinase assay. The peak areas represent the 
abundance of peptides containing non-phosphorylated S99 (lower lane) 
or phosphorylated $99 (upper lane). The levels of $99 phosphorylation 
were much higher after the addition of AMPK in the in vitro kinase assay. 
c, Various amounts of unphosphorylated or phosphorylated TET2S99 
peptides were spotted onto nitrocellulose membrane and detected by 
purified anti-TET2 pS99 antibody (right) or unpurified whole serum 
(left). The TET2pS99 antibody specifically recognized phosphorylated S99 
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peptides. Figures in a—c represent three biologically independent repeats 
each. d, Quantification of the radioactivity of AMPK phosphorylation. 

At the end of the phosphorylation assay, 20 1g of full-length Flag~TET2 
WT and mutant proteins bound to beads were subjected to measurements 
by Liquid Scintillation Analyzer. The calculation is based on the specific 
activity of [\-**P] ATP (800 c.p.m. per pmol) used and the calculated 
molar amounts of TET2 added to the assay tube. The stoichiometry is 
around 0.38 mol phosphates per mole of TET2WT. This number decreases 
markedly if the TET2 protein is mutated (S99A mutant or SLF mutant, 
both of which disrupt the AMPK-recognizing consensus sequence). 
TET2WT and the S1205A mutant had comparably high levels of 
phosphorylation. n= 3 biologically independent repeats, with data shown 
as mean + s.d. 
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Extended Data Fig. 5 | Increased AMPK activation, TET2 
phosphorylation and TET2 stability in cells cultured with normal 
glucose. a, Representations of TET2 $99 phosphorylation levels detected 
by LC-MS/MS in A2058-TET2WT cells cultured under high (left) or 
normal glucose (right). Flag-TET2 from A2058-TET2WT cells cultured 
under normal or high glucose was purified and then subjected to 
LC-MS/MS analysis. The peak areas represent the abundance of peptides 
that contained non-phosphorylated $99 (top) or phosphorylated 

$99 (bottom). The ratio was calculated as area(phosphorylated S99)/ 
(area(phosphorylated $99) + area(non-phosphorylated S99)), which 
indicated the level of $99 phosphorylation. $99 phosphorylation 
(TET2pS99) in cells cultured in normal glucose was significantly higher 
than the phosphorylation levels observed in cells cultured under high 
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glucose. Data are representative of three biologically independent repeats 
in each condition. b, A2058-TET2WT cells were switched from high to 
normal glucose for 0, 2 or 4 days. Results showed a steady increase in 

the levels of pAMPK as well as TET2pS99 and Flag-TET2 during this 
period. Data are representative of three biologically independent repeats. 
c, Western blot showing increased levels of TET2pS99 and pAMPK in cell 
lines (PBMC, HUVEC, TF-1) cultured under normal glucose versus high 
glucose. Data are representative of three biologically independent repeats. 
d, Western blot comparison of TET2, TET2pS99, and pAMPK levels 
between PBMCs from healthy donors and from patients with diabetes. 
TET2, TET2pS99 and pAMPK levels were significantly higher in healthy 
donors than in patients. Data are representative of three western blot 
repeats. 
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Extended Data Fig. 6 | Effects of AMPK activators (A769662 and 
metformin) on TET2 protein stability and 5hmC levels. a, A2058- 
TET2WT cells were treated with metformin and then immunoblotted 
with the indicated antibodies. Metformin treatment increased the levels of 
pAMPK, TET2pS99 and Flag-TET2. b, A2058-TET2WT cells were treated 
with 2 11M, 10 1M or 100 1M A769662 (an AMPK activator) for 30 min 

or 1 h, and then immunoblotted with the indicated antibodies. A769662 
treatment increased pAMPK, TET2p$99 and Flag-TET2 levels. 

c, Quantification of normalized Flag-TET2WT half-life in A2058- 
TET2WT cells treated with CHX and with (+) or without (—) metformin 
(5 mM) in Fig. 3f. Metformin treatment increased the half-life of TET2. 


d, Left, CHX was used to measure the half-life of Flag-TET2 in A2058- 
TET2WT cells treated with (+) or without (—) A769662 (100 1M) in high 
glucose. Right, quantification of normalized Flag-TET2WT treated with 
CHX and with (+) or without (—) A769662. A769662 increased TET2 
stability from 42% to 73% at 4 h after CHX treatment. Data in a—d are 
representative of three biologically independent repeats each. e, hMeDIP- 
qPCR showed that metformin treatment increased DhMRs to similar 
levels as observed in normal glucose in the previously validated genes in 
Extended Data Fig. 2c. n= 3 biologically independent repeats, data shown 
as mean + s.d. 
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Extended Data Fig. 7 | Effects of AMPK knockdown on 5hmC in cells 
expressing TET2WT, TET2S99A or TET2S99D. a, b, Western blot (a) 
and RT-qPCR (b) to measure TET2 protein and mRNA levels in cells 
expressing TET2WT, TET2S99A or TET2S99D. TET2S99A has the lowest 
protein level (a), but it has the highest mRNA level (b) amongst the three 
cell lines. c, Representations of HPLC-MS/MS analysis of genomic DNA 
from A2058 cells expressing TET2WT, TET2S99A or TET2S99D with or 
without AMPK knockdown (AMPK KD) in Fig. 3i. Left, chromatograms 
of 5hmC in cells expressing TET2WT or mutants with or without AMPK 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | In vitro characterization of the effects of glucose 
and the AMPK-TET2 axis on tumour suppression. a. MTS proliferation 
assay shows that A2058-TET2WT cells grew significantly more slowly 
under normal glucose than under high glucose, but mock cells showed 

no difference; n= 5 biologically independent repeats, P= 5.68 x 107°. 

b, TET2 was effectively knocked down (KD) in TF-1 cells, as confirmed 
by western blot (left) and RT-qPCR (right). c, TF-1 TET2 KD cells 
exhibited cytokine-independent growth under no cytokines, consistent 
with a previous report”. d, Proliferation of TF-1 shControl and shTET2 
cells treated with high or normal glucose under the indicated cytokine 
concentrations. TF-1 shControl proliferation was slower under the lower 
glucose condition, but TF-1 shTET2 cells showed no differential growth 
rates. P values (from left to right) are 0.0078, 1.94 x 107%, 3.21 x 10-4, 
0.0011, and 0.0091. a and d suggest that TET2 is a crucial point through 
which glucose levels affect tumour growth rates. n= 3 biologically 
independent repeats in b-d. e, Left, dot blot showing 5hmC levels 

in A2058-TET2WT, A2058-TET2S99A and mock cells with (+) and 
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without (—) metformin (5 mM) treatment. Right, normalization of n=5 
biologically independent dot blot repeats; P= 0.026. f, MTS proliferation 
assay of A2058 cells expressing TET2WT, TET2S99A or mock treated with 
(+) or without (—) metformin (5 mM); n=5 biologically independent 
repeats, P=7.18 x 10~°. g, h, Soft agar growth of A2058-TET2WT, 
A2058-TET2S99A and mock cells treated with 0 mM, 2 mM or 4 mM 
metformin (g) and respective colony counting (h). P=6.99 x 107° (top), 
0.004 (middle), 0.0013 (bottom). i, j, mRNA (i) and protein (j) levels in 
MDA-MB-231 cells stably expressing mock, TET2WT or TET2S99A. 

k, 1, Soft agar growth (k) and quantification (1) of MDA-MB-231 cells 
stably expressing mock, TET2WT or TET2S99A; n = 3 biologically 
independent repeats. The data suggest that TET2WT suppresses 
anchorage-independent tumour cell growth in vitro. Prevention of TET2 
phosphorylation at $99 results in loss of this TET2-mediated suppression. 
Two-sided Student’s t-test, data shown as mean + s.d. *P < 0.05, 

**P < 0.01, ****P < 0.0001. 


© 2018 Springer Nature Limited. All rights reserved. 


LETTER 


a 
Non-Diabetic Diabetic 
Tumor type TET2 TET2 Mock Mock TET2 TET2 Mock Mock 
Metformin - + = + = + - 
b 


= 


STZinduction 
Day1 


c Diab TET2 and Mock 
600 4 ~@ Diab TET2WT 
500 4 <= Diab TET2WT + Met { 


aon -a- Diab Mock ce 


100 


= Diab Mock + Met = 


Tumor size (mm?) 


Days post inoculation 


Diab vs. non-diab 
KK 


Tumor size (mm?) 


OQ 
g # 
é 


Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | The mouse study paradigm and examination of 
the regulatory function of the tumour-suppressive glucose-AMPK- 
TET2 axis in vivo. a, Division of mice into eight groups. b, Outline of 
experimental procedure. In brief, nude mice were first induced to develop 
diabetes, and then transplanted with designated tumour cells. Tumour 
formation and sizes were documented continuously for three weeks, 
followed by histological and pathological examination. See more details 
in Supplementary Information. c, d, Growth curves of A2058-TET2WT 
and A2058 mock tumours in diabetic (c) and non-diabetic (d) nude mice, 
with and without metformin treatment. n =5, P=0.048 in c; n= 4-5, 
P=0.0046 in d. e, f, Comparison between endpoint A2058-TET2WT 
and mock tumour sizes in diabetic and non-diabetic mice. Mice were 
treated either with (f) or without (e) metformin, n = 4-5. Tumours from 
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diabetic TET2 groups were significantly larger than that from non- 
diabetic TET2 groups in both e and f. However, mock groups showed no 
difference between diabetic or non-diabetic conditions in either e or f. 
*P = (0.026 (bottom), ***P = 0.00043 (top) in e; **P = 0.0023 (bottom), 
*** P — (00013 (top) in f. g, Western blot showed successful TET2 knock 
down (A2058-TET2KD cells) in comparison with its TET2WT precursor. 
Data are representative of three biologically independent repeats. 

h, Growth curves of A2058-TET2WT and A2058-TET2KD tumours 
with and without metformin treatment, n = 4-5. The curve indicates 

that A2058-TET2KD tumours were no longer suppressed by metformin, 
and grew larger than A2058-TET2WT tumours; *P = 0.031. Two-sided 
Student’s t-test, data shown as mean +s.e.m. *P < 0.05, **P< 0.01, 

***P < < 0.001, ****P < 0.0001; ns, not significant. 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | 5hmC and pAMPK levels in tumour 
xenografts after metformin treatment in diabetic and non-diabetic 
mice. a, Representative histology of pAMPK IHC staining in mock and 
A2058-TET2WT tumours treated with (++) or without (—) metformin 
(200 x); pictures represent four different stained slides in each group. 
b, Representative histology of 5hamC IHC staining in mock and A2058- 
TET2WT tumours treated with (+) or without (—) metformin (200 x); 
pictures represent four different stained slides in each group. All slides 
were counterstained with haematoxylin (light blue). c, d, Statistical 
analyses of the average score of pAMPK and 5hmC staining in mock and 
A2058-TET2WT tumours treated with or without metformin. Counts 
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were done on 10 random fields in each group. Metformin treatment 
increased pAMPK in both mock and A2058-TET2WT tumours 
xenografted into diabetic and non-diabetic mice; ***P=6.06 x 10-6 

(1), **P= 0.0013 (2), ****P= 1.04 x 107! (3), ****P = 3.53 x 10° (4). 
However, metformin increased 5hmC only in A2058-TET2WT tumours 
in diabetic and non-diabetic mice, with no increase in 5hmC in mock 
tumours under metformin treatment; **P = 0.0010 (5), **P=0.028 (6). 
Box plot: centre lines, median; limits, upper and lower quartiles; top and 
bottom whiskers, minima and maxima. Two-sided Student’s t-test, data 
shown as mean +s.d. *P< 0.05, **P< 0.01, ***P< 0.001, ****P< 0.0001. 
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Extended Data Table 1 | Correlation data analysis of 5hmC in diabetes, glucose/TET2 co-regulated genes and metformin impact on blood 
glucose 


a Correlation between 5hmC and various health parameters. 
Health Parameters Beta Coefficient P-value 
Presence of diabetes -13.51 0.0001 
BMI -0.8 <0.0001 
Age -0.51 <0.0001 
Systolic blood pressure -0.33 0.007 
HbA1C -0.23 <0.05 
b 
CYB561D1 FDX1 FAM195A_ —TACC3 NUP8S HSPH1 HAUS7 IMPDH1 ZNF408 SLC25A19 
GTSE1 RPL13P5 cDC73 PLEKHB2 SLC25A15 FOXM1 ESF1 NUDT19 NOP2 NCAPH 
LsM10 REXO4 MCM7 NR3C1 PDCD7 C12orf44 — TCOF1 WASF1 LyPD1 PAN3 
MRPL47 MRPL35 HOXB2 RALA TIPIN GNG10 BRI3BP NOP14 RPS27L TMEM98 
RINT1 STOM SOCcS7 SASS6 GTF2E2 ITGAE Clorf109 XPO4 CEP55 FAM73B 
NME7 ANKRD33B © DOLPP1_=—s ORC4. MRPL46 WoR77 EP400 TMEM201 —-TEAD4 HHEX 
TSR1 CXorf40A NIPA2 ExO1 2223 ARMC6 PNP GLRX2 ACOT9 1QCcB1 
svip UNC119 KIF14 GPNMB —-HSPA9 TMEMSSB = CRADD QRICH1 MFHASL = MT1H 
M6PR RFC4 SPAGS CENPA C15orf61 SNRPA1 GABPB1 MCM6 C190rf48 MAP4K3, 
CSorf30 BCL2L12 KIFC1 LCLAT1 DSCC1 BRIX1 TOMM22 BUB1B EXOSC8 CRY1 
NSUNSP2 C10orf32 RAN ZNFS76 mMyYo19 KIAA0101 ORC6 NCAPD3. HOXC10 C9orf40 
CHEK1 PIGU DSN1 FGF13 ACYP1 OPHN1 LSG1 TAFS HYI AMOTL2 
SHROOM2 TCEAL8 MODH1 DARS2 CHST10 C2orf47 STRA13 RNF26 RNASEH2C EPHA2 
Downregulated in YRDC SETD8 WDR36 IPOS ADM TRMT11 UBE2C MRPL15 RNMTL1 GLRX5 
Normal-g vs. High-g RWDD3 GADD45A ANP32B RAD54L2 SERTAD2 ADO RFC3 TSNAX RRS1 
MTPAP. C16orf80 H2AFX PPID METRNL KIAA0947 ZNF330 Clorf112 MCM3 
MSTO1 CDCSL FOXD1 RNF149 RRP7A PPP6C FST CKS1B SERTAD1 
VARS2 DDXS DUSIL PDRG1 ccT3 LRRC5S8 DCTPP1 ANAPC4 ——CHKA 
EZR OSBPL3 KIAA1033 = C1lorf31 CENPN Clorf9 P2RXS CRNKL1 SUV39H1 
TFDP1 DUT TGS1 CTCF LIMA1 NSUN2 DDX46 MMGT1 ABL2 
MAD2L1BP cDCc4s UBE2T CCNB1 ETF1 EIF3B HOXB7 SRRM1 HIST1H2BK 
ASNSD1 UBR7 BTAF1 COIL FLRT3 CASP7 CDC26 TIPARP. POLA1 
ZNF783 CKAP2L PTPLB PCID2 ZNF721 ZC3HC1 spc1 EMP1 ERCC6L 
CcOx10 ARL4A PPAT POGK DHX33 SRXN1 RBM25 NOL11 POLE2 
RBM22 C200rf20 = TYW3 NUP5SO IRS2 ZWINT PAK1IP1 = URM1 ZNF274 
PNN NUBP1 DNAJB14 LIC TMEM97 PRIM1 CDCA3 BYSL CDCAS 
SMC2 SMNDC1 UTP20 SRRM2 WDR45L BAZ1A AGPAT9. PIGW ATADS 
MPHOSPH10 =MKI67 ADCY6 BTBD6 UG1 FLNB CCNT1 MSL1 WDR85 
ccbc101 ACBD3 SHQ1 DCLRE1A UQCRFS1 MYBL2 FANCI BLM C8orf73 
TIGDS NANP. CHERP. NAA4O POP7 CPSF7 POLR3K PFAS ATP6VOE2 
ERCCS SERPINB1  ZCCHC24  KDMSB PSIPL GLG1 LGALS1 NEU1 PSAP RASA1 PPAL 
PAN2 YPELS TNC PCYOX1 IMPACT NES TULP3 ETV4 DHX32 DYNLT1 UBAC2 
C3orf32 RABAC1 STX18 RALY NDUFA9 SPA17 SLK CXXC5 GTSF1 MAN2B1 DNASE2 
PRKCD BUD13 MAGED2 _PIGH CcDC104. —s BRMS1 PLDN SNX1 GPX4 ARAP1 SF3B14 
SH3BPS STOML2 FAM60A WDR45 HEBP1 ATRN SPSB2 SHISAS. CD97 NGFRAP1 
DALRD3 EMP3 CALHM2 —PKIG LDLRAP1 = AP3B1 SYTLS GDE1 PLA2G12A  RWDD4 
B4GALT2 MLKL APOA1BP —_CREBL2 STX16 BAX RDH11 PIM3 EIFSA MECR 
TLCD1 NRIP1 COL16A1 AKR1A1 GNAI2 CTSB PYCRL GOLGAS CCNG1 PFDN1 
GGPS1 PLOD1 TOR1B MRC2 COX17 NAPRT1 ZNFS12B PLP1 KDELR3 C17orf48 
MAML2 SPATA20 CYBA POLR2G UQCRQ BSG LHFPL2 UPRT IDE PFDNS 
TMEM181 TRIOBP BLMH $co2 TAF6 PREX1 LOC401397_ _POT1 AMFR POR 
SPESP1 C21orf33 G6PD LOC645212 SNX30 TMEM41B HAGHL RALGDS. LEPREL4 LGALS3. 
ELP4 RPS6KB2 KIAAO319L SLC26A2 TCF7L2 PACSIN3 FAM116A GTF2A1 SDHA CDKS 
Upregulated in GBAS HOXAS NFU1 FAF2 LGALS8 TSHZ1 RBL2 GLT25D1 GNAQ MFSD10 
Normal-g vs. High-g LOC147727 NTSE HAPLN1 ACADVL ZYG11B DCP1A HIBCH LAMB1 SH3GLB1 VvPS11 
SLC39A1 cEP192 cKB KCTD3 ARL2 PRKCI ATG4A SLC25A6—-TTC13 HINT2 
C10orf125 TPDS2L1 C6orf70 SLU7 DAP CAB39 IGF2BP3 EIF2A IFI30 VPS4A 
OCEL1 PAPSS2 $100A3 SCAMP1 NRAS CETN3 EAPP FAM108C1 = TTC32 ESYT1 
CYFIP1 PPP1CA MANSC1 SMYD3 ABR BCKDK 1L13RA2 ATPSA1 IGBP1 MLLS 
BBS2 MYH10 ATP6AP1 CLIP4 PMVK TBCE SCAND1 SLC44A1 UGGT1 MGAT4B 
HMGCL AMOTL1 PRKAR1A MYBBP1A MBTPS1 RB1CC1 CD70 TMEM192 FAM82A2 RAB11FIPS 
FZD7 SNHG8& HSD17B8 HTRA2 TWF2 SV2A ARL6IPS DTX3L PRPSAP2 KIAA1143 
OSTF1 SIDT2 DECR1 RFXANK Clorf85 TSPAN31 LOC152217_ LCMT1 ATG9A AP2B1 
C6orf120 KIAA1191 — SNX10 SLC25A46 CSorf54 EIF4B TTC19 AGGF1 ZMYND8 = SDHC 
FAM125A C4orf52 POLR3GL MTFMT ROBO1 BCKDHA NDRG3 DHX40 SRP14 MTAP 
FKBP2 SFRP1 ALKBH6, ARPC1B EMD. HADH VPS29 FKBP11 FAM86A TMEM14C 
coG4 FAM174A PTGFRN ZKSCAN1 MTR CBR3. NDUFBS UBE2L6 NECAP1 DHX16 
Mxi1 NFATC2 MPV17 KCNN4 PPP1R7 GABARAP. EDEM1 ACVR1 VPS72 EPS15 
FAM114A1 SH3BGRL3 PTPN14 RNF181 SERINC1 FADS3 CRELD2 LOC642361 AKR7A2 HOXB3, 
TCTN3 RIN1 C4orf48 DSG2 TSPAN14 COX7A2 RNF123 ALDH2 NTSDC1 BNIP3 
c 
Status Tumor Metformin Average Blood Glucose (mM) Status Tumor Metformin Average Blood Glucose (mM) 
TET2 ; 6.3040.59 TET2 - 12.9442.27 
er TET2 + 5.6540.73 TET2 + 14.1643.76 
Non-diabetic Diabetic 
Mock . 5.4040.84 Mock é 14.0843.49 
Mock + 5.60+0.69 Mock + 14.7642.90 


a, Univariate regression analysis showing presence of diabetes, BMI, age, systolic blood pressure, and HbA1c in association with 5hmC. These health parameters, particularly the presence of diabetes 
and HbA\Ic, are negatively correlated with ShmC. n= 28 healthy donors and 29 diabetic patients. Two-sided t-test. 

b, Glucose-modulated and TET2 dependent genes. Five hundred and eighty-five genes that are up- or downregulated in normal glucose compared to high glucose in A2058-TET2WT cells are listed. 
These genes had differential expression under normal glucose versus high glucose in A2058-TET2WT cells only, and not in A2058-TET2M or mock cells. These genes are defined as glucose-modulated 
and TET2-dependent. Expression profile of these genes is shown in Fig. 1h. 

c, Blood glucose levels in diabetic and non-diabetic mice treated with or without metformin. Blood glucose levels were measured on the last day of experiment. Metformin treatment did not alter blood 
glucose levels in either diabetic or non-diabetic mice. n=4—5, data shown as mean +s.d. 
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HOW TO TEACH AN OLD 
SEQUENCER NEW TRICKS 


Outdated DNA-sequencing machines need not die — researchers can repurpose them to 
drive next-generation biochemistry studies. 


BY JEFFREY M. PERKEL 


University in California, the guts of a dozen 

DNA sequencers lie exposed — hundreds 
of thousands of dollars worth of cameras and 
lasers, optics and fluid controllers, all scavenged 
from a late-model, next-generation Illumina 
DNA sequencer called GAIIx. On the floor, the 
shell of one old instrument sits empty, picked 
over like a carcass. “I seem like a hoarder,’ says 
Stanford biophysicist William Greenleaf. 

But over the past 6 years, this collection has 
fuelled an effort that has engaged about half 
of Greenleaf’s 18-member lab team. Whereas 
most researchers use DNA sequencers to, well, 
sequence DNA, Greenleaf’s team is one of a 
small number that has repurposed the devices 
for an entirely different goal: to study protein 
and nucleic-acid biochemistry on a massive 
scale, from macromolecular interactions and 


[I a basement storeroom at Stanford 


This next-generation biochemistry platform, dubbed ‘Serenity’, was assembled from the components of an old Illumina DNA sequencer. 


RNA folding to enzyme function. 

“Tt's a revolutionary technology,’ says Stan- 
ford biochemist Dan Herschlag, who uses 
it to study interactions between RNAs and 
other molecules. It provides “deep and broad 
quantitative information’, he says, “that allows 
researchers to build more-precise biophysical 
and cellular models for molecular interactions, 
and which is also a critical step towards a truly 
predictive understanding of biological systems” 

Broadly, the work demonstrates what’s pos- 
sible when scientists look into the guts of their 
hardware — proof that equipment isn’t nec- 
essarily without value just because it is old or 
outdated. 

But there’s a reason such technology 
development is called bleeding edge: things 
often go wrong. Sarah Denny, a biophysicist 
who graduated from Greenleaf’s lab this year, 
chuckles when asked whether her equipment 
offers ‘plug-and-play’ simplicity. “Many times 
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when you did an experiment, something 
would break and youd have to figure out how 
to get it to work again,’ she says. But given the 
volume of data she could extract, the reward 
was worth the pain. In Denny’s case, her team 
gained a better understanding of RNA folding. 
Such is life in the do-it-yourself trenches. 


BIOPHYSICS ON A CHIP 

When the GAIIx dropped in 2008, it was a 
hot commodity. Some sequencing centres 
had dozens of instruments, costing about 
US$600,000 apiece; in 1 week, a machine could 
push out 30 billion of the lettered bases that 
make up DNA. But by 2011, when Greenleaf 
established his lab at Stanford, the industry 
had migrated to faster hardware — such as the 
Illumina HiSeq 2000 — that was more effi- 
cient and user friendly, and people were giving 
their old machines away. “They’re basically big 
paperweights,” says Greenleaf. 
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> Illumina sequencers automate a sequenc- 
ing-by-synthesis process. A DNA library is 
randomly arrayed ona device called a flow cell, 
and amplified in place to create small clusters 
of about 1,000 molecules, each representing a 
single fragment of genetic information. The 
building blocks of DNA, or nucleotides, are 
then transferred onto a chip, each containing 
a unique fluorescent signature and a reversible 
chemical modification. This process ensures 
that only a single nucleotide can be added at 
each cluster. The sequencer then images the 
array, and on the basis of the colour at each 
position, ‘calls; or reads, the base that was added. 
The modification is then removed, and the pro- 
cess repeats, allowing the entire sequence to be 
identified base by base. 


THE BASICS OF HACKING 

At their core, these instruments are high-end 
microscopes coupled with liquid handlers that 
help to move reagents around. Some of their 
components — particularly cameras, lasers 
and movable stages — can cost tens of thou- 
sands of dollars. In 2009, Christopher Burge, 
an RNA biologist at the Massachusetts Institute 
of Technology in Cambridge, realized that it 
might be possible to repurpose that hardware 
to do something else. 

“The GAI/ is basically a pumping system 
that pumps things onto the flow cell and then 
onto a fancy imaging system,’ Burge says. “We 
realized that, well, maybe you could pump 
other things onto the flow cell.” 

That, he says, is because the GAIIx was an 
‘open system, controlled using editable con- 
figuration files called recipes and loaded up 
with reagents that could be changed simply 
by substituting one tube for another. On the 
inside, the machine was equally open, with 
off-the-shelf, third-party components held 
together with cable ties. “In retrospect, it looks 
like a high-school science project,” says Gary 
Schroth, a biochemist who directs the genom- 
ics-application group at Illumina in San Diego, 
California, and who collaborated with Burge 
on his early studies. 

New Illumina instruments, by contrast, are 
more polished, with custom hardware, hard- 
wired control software and barcoded reagents 
— features that improve the user experience but 
preclude hacking. 

Jacob Tome helped to hack a GAIIx as a grad- 
uate student in the lab of molecular biologist 
John Lis at Cornell University in Ithaca, New 
York. He recalls “a lot of panicked phone calls” 
with Illumina representatives, trying to work 
out the intricacies of the company’s instrument- 
control software. “There’s no manual to repro- 
gram the instrument,’ he says, so working out 
the logic was largely trial and error. 

“One of the most exciting weeks of my 
PhD was when we just kind of sat there at the 
sequencer, and wed write a recipe and then 
watch the temperature go up and watch what 
solution it was pumping,” Tome says. 

Burge, however, had help. Working with 
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Schroth, his team modified the GAIIx recipes 
to accept new reagents, introduce pauses, adjust 
the running temperature and alter the imaging 
parameters. The researchers then applied that 
modified system to conduct a comprehensive 
study’ of the sequencing preferences of a DNA- 
binding protein in yeast called Gcn4p. 

“We showed for the first time that you could 
turn a sequencer into a biophysical instrument 
to measure protein-DNA interactions at very 
high throughput,’ Burge says. 

To do that, Burge’s team treated the flow cell 
— which is usually discarded after sequenc- 
ing — to regenerate double-stranded DNA at 
each cluster. They then introduced fluorescently 
tagged Gcn4p protein to the chip at progres- 
sively higher concentrations, and used modi- 
fied software to trick the sequencer into imaging 
the flow cell as if it had just completed another 
round of sequencing chemistry. By download- 
ing and processing the final images, rather than 
the base calls the instrument typically outputs, 
the team could measure how much Gcn4p was 
bound at each place. From that, the group could 
deduce its affinity for every sequence on the 
chip — some 440 million measurements in all. 
“That was a really beautiful, elegant experi- 
ment,” says Greenleaf. It showed that the 
GAIIx platform had the potential to provide 
“all the sorts of measurements I was excited 
about: kinetic measurements, on-rates and 
off-rates and equilibrium constants, all the 
physical measurements that one might need 
to understand from first principles how DNA- 
protein interactions work” 

Several researchers have used similar hacks 
to address questions of their own. At the Mas- 
sachusetts Institute of Technology, for instance, 
RNA biologist David Bartel looked at messenger 
RNA (mRNA), the RNA copy ofa gene that is 
used to produce proteins. His team found a link” 
between the number 


of ‘A bases strung at “When it 
theendofanmRNA «actually works, 
molecule—a feature boy, it really 
known asapolyade- cooks. Nobody 
nylated tail— and can generate 


how efficiently pro- datalike that.” 
teins are produced 
from genes in animal development. 

Ilya Finkelstein, a biophysicist at the Uni- 
versity of Texas at Austin, hacked a sequencer 
flow cell to investigate* the DNA-binding 
preferences of a protein called Cascade, used 
in one version of the gene-editing technique 
CRISPR-Cas. And some members of Green- 
leaf’s team worked with CRISPR pioneer 
Jennifer Doudna at the University of Califor- 
nia, Berkeley, to study* why Cas9 — the most 
commonly used enzyme in CRISPR — some- 
times cuts at the wrong place in the DNA. 

Researchers have also worked out methods 
for transcribing on-chip DNA to create ultra- 
high-throughput RNA arrays — an approach 
that could be used to study RNA-protein inter- 
actions or to screen libraries of folded RNA 
molecules called aptamers. 
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The challenge of making such an array is cap- 
turing the transcribed RNA. Lis and Greenleaf 
independently introduced*” a physical block at 
the end of double-stranded DNA molecules, 
stalling RNA polymerase — the enzyme that 
creates the mRNA molecule — with its dangling 
transcript. At Weill Cornell Medical College in 
New York City, RNA biologist Samie Jaffrey’s 
team — led by chemical biologist Nina Svensen, 
then a postdoc in the lab — used‘a special, viral 
RNA polymerase to chemically link the newly 
synthesized RNA to the chip, and then added a 
nuclease to degrade the template DNA. 


HACKERS AGAINST DISEASE 

For Stanford biophysicist Rhiju Das, such RNA 
arrays have served as platforms for identifying 
molecules that can diagnose active tuberculosis. 
The project began when a colleague discovered’ 
a signature of three RNAs that can distinguish 
between dormant and active tuberculosis. The 
challenge was to pinpoint a single molecule — a 
‘riboswitch — that could actually identify which 
state the disease was in. Such a diagnostic mol- 
ecule would have to be able to bind all three 
RNAs, plus a fluorescent reporter, and change 
shape depending on which ones were present. 
“Tt’s totally bananas, obviously,’ Das says. 

To solve the puzzle, Das recruited the user- 
base of an online game he co-developed, 
called eterna. Players are tasked with design- 
ing RNAs that can fold in certain ways; the 
most promising structures get synthesized 
as part of a DNA-sequencing library, 10,000 
at a time, and tested on the GAIIx chip. With 
10,000 measurements per design, that’s about 
100 million data points from each experi- 
ment — data that are fed back to the players to 
hone their skills. Over time, some users have 
become quite adept, Das says. “These players 
have now discovered principles that let them 
get perfect riboswitches, thermodynamically 
optimal riboswitches for a variety of test cases.” 

And that includes the OpenTB Challenge. 
Over 3 rounds of game play, Das’s team has 
tested nearly 27,000 riboswitches from 187 
players , identifying several that seem to fit the 
bill. One, submitted by a retired engineer in 
Falmouth, Massachusetts, looks like the floor 
plan ofa castle in one state, and like an anchor 
in the other. Now, with funding from the Bill 
& Melinda Gates Foundation, Das plans to 
begin testing some of these designs for use as a 
pregnancy-test-like diagnostic for tuberculosis. 

Meanwhile, a few scientists have taken the 
next logical step, translating on-chip RNAs into 
protein. In 2016, Svensen showed’ she could 
synthesize short protein sequences called pep- 
tides on the array, by exploiting the antibiotic 
puromycin to tether the peptide to the surface. 
And in June this year, Greenleaf-lab postdoc 
Curtis Layton reported® developing an assay 
called Protein display on a Massively Parallel 
Array to profile 156,140 variants of an enzyme 
called SNAP-tag. He wanted to learn the subtle 
rules that relate amino-acid sequences to pro- 
tein function. “It’s been so exciting to see just a 
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Fluorescence imaging of a sequencing flow cell reveals binding interactions at specific clusters. 


whole new scale of protein functional analysis 
that we're able to do on these machines,” he says. 

Exciting, but not easy: Layton spent four or 
five years developing his method and hard- 
ware. Svensen spent two years developing 
her method, which Jaffrey’s lab is now apply- 
ing to create a whole-human-proteome chip. 
“To buy a sequencing kit for the GAII back 
then was $5,000. You can't really make many 
rounds of optimization if you have to spend 
that for every experiment you do,’ she says. As 
aresult, Svensen, now at the Wellcome Centre 
for Anti-Infectives Research at the University 
of Dundee, UK, did much of her testing on 
microscope slides, or by using discarded flow 
cells from her department's core facility. 


A SPACE ODYSSEY 

Most GAIIx hackers use the machine largely as 
is, tweaking the instrument software to make 
it do what they want. According to Jaffrey, that 
makes it accessible to anyone with an old GAIIx 
lying around. “We don't want something only an 
engineer can use,’ he says. “Our protocols can 
be used by any researcher with that machine” 

And for its initial RNA work on the GAIIx, 
the Greenleaf lab was no different. Donated by 
Stanford geneticist Patrick Brown, the system 
used in that work, Layton says, was named in 
honour of the Star Wars spacecraft Millennium 
Falcon — specifically, pilot Han Solo’ plea to 
the ship: “You hear me, baby? Hold together!” 

“It’s this old machine that we're still trying 
to run for something it wasn't intended to do,” 
Layton says. “Just don't give us any errors, don't 
give us any problems. But, when it actually 
works, boy, it really cooks. Nobody can gener- 
ate data like that.’ 

Ultimately, the team decided the approach 
was unsustainable. For one thing, says Layton, 
unlike modern sequencers with their push-but- 
ton interfaces, the GAIIx was less Millennium 
Falcon and more “like flying an Apollo space- 
craft”. Newer sequencers are also faster, and pro- 
duce longer reads. Plus, Illumina announced in 
2015 that it would be ‘end-of-life-ing’ the GAIIx, 
meaning that flow cells, reagents and parts 


would no longer be available as of July 2017. 

The team extracted the system's optics, stage, 
camera, pumping system and lasers; supple- 
mented the sequencer’s built-in illumination 
system — which lights up the sample at an 
acute angle from below — with downward- 
facing widefield illumination; designed a new 
printed circuit board to operate the lasers; 
cobbled an autosampler out of GAIIx parts to 
automate reagent addition; and added an on- 
board temperature controller. 

Most significantly, the team designed new 
sample holders, so the GAIIx imaging system 
could accommodate Illumina’s new, smaller 
MiSegq chips. “That gives us the ability to 
decouple our biophysical measurements of flu- 
orescence and the sequencing,” Greenleaf says. 
(Finkelstein also runs his assays on MiSeq chips, 
imaging them on a conventional microscope.) 

The result is a device to warm an engineer's 
heart: a compact black cube adorned with what 
looks like a ripped-open desktop computer on 
one face. The hardest part, says Layton, was cod- 
ing the software required to control the hard- 
ware, and working out which signals would 
drive what action. “That was a Herculean effort?’ 
he says. But overall, the project appealed to his 
hobbyist interests in engineering and software 
design, not to mention biochemistry. Winston 
Becker, a dual PhD and medical student in 
the lab, with a master’s degree in engineering 
mechanics, was drawn to the project for similar 
reasons. “I wanted to do something very physi- 
cal, very quantitative. And obviously an oppor- 
tunity to use instrumentation skills and build 
things was really exciting, too” 

But wouldnt it have been easier to just buy a 
microscope instead? According to Greenleaf, 
that wasn't really an option — or at least, not 
an affordable one. “There's really nothing out 
there,” he says, “nothing that will do the sorts 
of things that we want to do.” A custom instru- 
ment, he estimates, would probably have cost 
hundreds of thousands of dollars. 

Das, who built his own instrument off 
Greenleaf’s design (and with Greenleaf-lab 
assistance), says the experience was “so fun”. 
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But with no how-to manual, “you have to be 
willing to go bug people’, he says. Still, vexing 
issues can arise. Das’s instrument wouldn't 
focus properly, for instance. After two weeks 
of troubleshooting, Das realized he was miss- 
ing a lens. “It was so dumb,” he says. (He has 
since compiled a 64-page instruction man- 
ual. Another researcher, Nick Kaplinsky at 
Swarthmore College in Pennsylvania, has 
published detailed blog commentary (see 
go.nature.com/214fdme) describing his work 
turning a GAIIx into ‘RootScope; a microscope 
for imaging plant roots.) 

Sticking with the tradition of naming instru- 
ments after fictional starships, Das christened 
his instrument Red Dwarf, after the British 
science-fiction series; this team then added a 
sister ship, Nostromo (Alien). Other Greenleaf 
imagers include Heart of Gold (Hitchhiker's 
Guide to the Galaxy), Borg Cube (Star Trek: 
The Next Generation) and Serenity (Firefly). 
“If you've built an instrument, you will get 
the glory of the name,” Greenleaf says. His 
lab actually has a dozen sequencers in various 
states of disassembly throughout the building, 
most of them acquired for the cost of shipping. 

If things go according to plan, that fleet 
will become ever more integral to Greenleaf’s 
work. His lab is perhaps most famous for 
developing ATAC-seq’, a method for assess- 
ing how accessible chromatin is to the pro- 
teins that bind it, and half the lab is dedicated 
to that work. Now, Greenleaf hopes to merge 
the teams, using his hacked sequencers to build 
mathematical models that accurately reflect 
macromolecular interactions in the cell itself. 

The question is, will other researchers fol- 
low suit? Schroth says that between the rea- 
gents, the hardware and the analysis tools, 
these assays are so customized that they could 
prove difficult to reproduce in other labs, at 
least without a knowledgeable insider to help. 

But there's an easy fix, hackers say: open up 
modern platforms to exploration. “Then the 
scientific community could push the instru- 
ments in new directions that could ben- 
efit everybody,” says Burge. Finkelstein says: 
“There's still a lot of space to play with these 
sequencers off-label, if you will” m 


Jeffrey M. Perkel is technology editor for 
Nature. 
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Water supplies are scarce in parts of the Democratic Republic of the Congo because of poor infrastructure. 


Science and strife 


Five researchers explain how they run productive labs in war-torn or resource-poor regions. 


well-equipped laboratory stocked with 
Awe and supplied with uninter- 

rupted electricity and unlimited 
water might seem like a basic requirement for 
conducting research. But scientists who work 
in regions that have limited resources or that 
are riven by conflict cannot take such ameni- 
ties for granted. They must perpetually seek 
scarce grants, publish their own journals, form 
their own scientific societies and — crucially 
— draw on their deep reserves of resilience. 
Nature asked five such researchers how they 
run productive labs in the face of electricity 
shortages, border-checkpoint closures, poor 
Internet connections and other challenges. 


Engage with 
stakeholders 


Forestry researcher, University of the 
Philippines, Los Barios 


For the past 13 years, I have been profiling the 
contamination of the Marilao, Meycauayan 
and Obando River System (MMORS), which 
was on the ‘Dirty 30’ list of the most polluted 
places in the world in 2007, according to the 
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non-profit organization Pure Earth. There are 
many polluting industries upstream, including 
the largest lead smelter in the Philippines, gold 
smelters, jewellery workshops and tanneries. 
Downstream are fish farms. We found elevated 
levels of heavy metals in the water, in the sedi- 
ments and in fish, especially shellfish, which 
are sold in the local markets (M. E. T. Mendoza 
et al. J. Nat. Stud. 11, 1-18; 2012). At least 
100,000 people in the municipalities of Marilao, 
Meycauayan and Obando, and in the metropol- 
itan Manila area, are eating contaminated fish. 
There are no toxicologists in the area who can 
accurately diagnose illnesses connected with 
heavy-metal ingestion. So when we looked 
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> at medical records, there were no entries 
for heavy-metal poisoning. If we cannot prove 
that these metals are causing harm to people, 
it’s very difficult to convince policymakers and 
local executives to take action. We have no local 
laboratories that can analyse heavy metals found 
in fish, or in water or blood samples. 

Local officials, the governor and some of the 
mayors were really antagonistic because the 
fishing industry is a major source of income 
for these municipalities. I have been very care- 
ful, even from the outset, to always update the 
mayors on our projects, and I am accompanied 
by local and regional government representa- 
tives whenever I do my monitoring activities. I 
do nothing without their consent and am very 
transparent in my work. 

One of my strategies was to build a network 
of stakeholders — including national agencies 
such as the Bureau of Fisheries and Aquatic 
Resources and the Department of Environ- 
ment and Natural Resources — that share my 
concerns. [also built a good rapport with peo- 
ple who live in the region. There are several 
associations for fishers and leather-makers 
in these areas, and we work with them and 
include them in consultations and meetings 
about water-quality management. Our project 
helped to have the area declared as a legally 
designated water-quality management area. 
That’s why we're able to continue our work. 

We used funding from Pure Earth to do 
regular longitudinal sampling in sections of 
the river system, including of sediment, water, 
fish and other aquatic life. There’s a problem 
collecting data and samples, because it is costly 
and the national and local governments have 
limited funds. There is also no single reposi- 
tory of data with which monitoring can be 
more effectively planned and analysed. 

Our monitoring results were included in 
a Pure Earth database that was shared with 
other stakeholders, including regional envi- 
ronmental-management offices and local 
government units. In turn, this encouraged 
those agencies to conduct studies to comple- 
ment our work and to share their data. So I 
was able to get money from the Asian Develop- 
ment Bank, Green Cross Switzerland and the 
Hong Kong Shanghai Banking Corporation, 
as well as a small amount from the Coca-Cola 
Company, to conduct environmental monitor- 
ing — including assessment of heavy metals in 
selected aquatic organisms. 


EMMANUEL I. UNUABONAH 
Use available 
resources 


Materials chemist, Redeemer’s 
University, Osun State, Nigeria 


Potable water is a challenge for us here in 
Africa and across the world: around 1.8 billion 
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people worldwide get their drinking water 
from a source that is polluted with faeces. As 
part of our work, we are developing hybrid clay 
composites to adsorb enteric bacteria, such as 
Escherichia coli, Salmonella species and Vibrio 
cholerae, from water. We also use composites 
made from readily available materials such as 
kaolinite clay, papaya seeds and plantain peels 
to extract heavy metals from water. 

We are not funded by the government. On 
average, for close to 100 days a year, we have 
no electricity. We have an alternative utility 
on campus, so when the power goes off at the 
national grid during work hours, the generator 
comes on. If we get lucky with timings, we are 
guaranteed 36 hours of uninterrupted power 
to run experiments. But when the generator 
isn’t running and the grid power has gone off, 
we just have to wait. 

Sometimes I use my salary to fund my 
research and to keep our students. Then I have 
to struggle to write international grants. I’m so 
grateful to The World Academy of Sciences in 
Trieste, Italy; the last grant it gave in 2014 (for 
US$63,230) took care of stipends, school fees 
and research expenses for the students, and we 
used part of it to buy equipment. A colleague 
at the University of Edinburgh, UK, sent us a 
$600 bacteria-testing kit last year, but we can't 
use it now because a related microscope part 
was damaged by a power surge. 

We have a lot of wonderful ideas, wonder- 
ful theses just hanging about the shelves, but 
nobody’s utilizing them. Some young scientists 
developed cheap electrical power systems from 
electronic waste materials, but they don’t have 
the money to develop them further. Nigeria 
has a thriving oil industry, but the govern- 
ment’s Petroleum Trust Development Fund 
uses oil-industry proceeds mostly to fund 
scholarships for Nigerian students abroad, and 
spends very little on scientific research. 


KALULU MUZELE TABA 
Aim for the 
possible 


Organic chemist, University of Kinshasa, 
Democratic Republic of the Congo 


Our research seeks to solve problems that 
have societal impacts, such as malaria, which 
is endemic in Kinshasa, the capital of the Dem- 
ocratic Republic of the Congo (DRC). People 
in the poorest areas of Kinshasa are growing 
about 55 different plants, including citronella 
and papaya, to try to treat the symptoms of 
the disease. We thought, why not investigate 
these plants? We tested eight of the most-used 
plants and showed that extracts and metabo- 
lites had considerable antimalarial activities. 
We have a small booklet in French and in the 
local language, Lingala, that we send to peo- 
ple to explain how to use these plants more 
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effectively. We are also studying plant extracts 
that can be used to treat antibiotic-resistant 
Mycobacterium tuberculosis (K. B. M. Jose et al. 
Med. Clin. Rev. 4, 5; 2018), using a grant from 
The World Academy of Sciences. 

We don't get money from the state. In our 
lab, we don’t have equipment. We don’t have 
money to buy solvents. Water is available 
between 5 a.m. and 7 a.m., so we have a con- 

tainer that collects 


“To scientists water at night, and 
working in during the day we 
comparable have a pump. For 
circumstances, _ electricity it’s much 
Iwouldsaythat harder. In the middle 
although it’s of the day, it can come 


and go many times, 
and you hope it won't 
go while you're work- 
ing. We used to have a small generator as a 
backup for computers, but it’s broken. 

We don’t fold our hands and cry and say that 
things will get better. We do whatever we can. 
Most of the time we buy our own reagents and 
solvents with our salaries. We try to motivate 
our master’s and PhD students by finding a 
way to collaborate with the outside, writing to 
foreign labs to see whether our students can get 
overseas fellowships. One student, Joséphine 
Ntumba, went abroad three times, to the 
Catholic University of Louvain in Louvain-la- 
Neuve, Belgium. She has completed her PhD 
and teaches at the University of Kinshasa. I 
did my PhD at Northwestern University in 
Evanston, Illinois, and then went to the Max 
Planck Institute for Coal Research in Miilheim 
an der Ruhr, Germany. It was hard to come 
back. It was not only material, but mental too. 
I knew that some things would be impossible, 
but I feel that I have to contribute and inspire 
young people in science. 

For the past five or six years, I have been the 
editor-in-chief of the journal Congo Sciences, 
which I co-founded. We started it because we 
wanted to bring visibility to research done 
in the country. The journal was financed for 
some time by the Academy of Research and 
Higher Education (ARES) in Brussels. The 
academy has stopped funding it now, but we 
are still publishing the journal. 

For the past ten years, I’ve been trying to 
create an academy of sciences for the DRC, 
similar to the American Association for the 
Advancement of Science in Washington DC. 
We have to try to get scientists together and 
to speak as one voice, and then perhaps the 
state can start understanding that financing 
research at the university is important. These 
are some of the things that make me feel happy 
that I came back home. Maybe I lost a lot as 
a scientist, but as a Congolese, I hope Ican 
do something for my place, and for the world. 

To scientists working in comparable cir- 
cumstances, I would say that although it’s 
hard, it’s not impossible. Know that you 
should find maybe not the best solution, but 
the least-worst one. 


hard, it’s not 
impossible.” 


MOHAMMED SAFIA 


=I 


GGateway students on the training scheme in information technology funded by the Basque government. 


RASHA ABU-SAFIEH 
Choose the 
positive 


Computer engineer and co-founder of 
GGateway,Gaza Strip 


I co-founded GGateway, a social-enterprise 
company in the Gaza Strip that provides 
outsourcing services for information and 
communications technology (ICT) around the 
world. We offer training and employment to 
recent university graduates in Gaza with ICT 
degrees. Our main goal with GGateway is to 
help people to have a source of income. With 
the shortage of clean water, the polluted sea 
and the blockade imposed by Israel and Egypt 
since 2007, living conditions here are dire. 

We came up with the idea of GGateway in 
2012. In November 2013 we launched a pilot, 
and in February 2015 we got the green light 
from the Korea International Cooperation 
Agency for a $1.3-million grant to fund our 
plan. That was one of the biggest, happiest 
things that ever happened to us. 

We were running a pilot project for the 
United Nations Relief and Works Agency 
for Palestine Refugees in the Near East 
(UNRWA), but in July and August 2014 we 
had the 50 days of conflict with Israel. That 
was difficult: the bombing went on all day, 
there was no electricity and we had limited 
access to water and food. There was no safe 
zone, no safe area. So we had to stop our first 
project. 


Two days after a ceasefire ended the conflict, 
we wrote up another eight new concepts for 
different projects, and the UNRWA agreed to 
operate three. We could either be positive or 
cry all day. We chose the positive side and to 
move on. 

The cables that we use for all our techni- 
cal networks are on the list of items that Israel 
does not allow to enter Gaza. With the sup- 
port of the UNRWA, it took us nearly four 
months to bring them in from Israel. Without 
the UNRWA, it would have taken us a year or 
more. We also use a generator because most 
days we get electricity for two to four hours. 

The overall unemployment rate among 
graduates in Gaza with ICT degrees is 70%. 
Among female ICT graduates, it’s 92%. We 
applied for grants that focus on vulnerable 
women, and got one from the Basque govern- 
ment in Spain to train 60 female graduates for 
jobs. We also got a grant from the US-Middle 
East Partnership Initiative to empower and 
train 300 ICT graduates to become profes- 
sional freelancers, and won a $3-million grant 
in June from the World Bank to train students 
and software engineers. We have contracts 
with UN Women and UN Habitat. 

We are able to travel outside Gaza only 
two to three times a year, maximum, and 
sometimes not at all; we need to apply for an 
exit permit from Israel with support of the 
UNRWA. Sometimes we are accepted, some- 
times rejected; no reason is given. We often 
lose opportunities if we are registered for a 
conference, for example, or for training. 

When we see what we are doing — that 
it’s changing people's lives, despite all of the 
difficulties around us — it makes us feel good. 
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ELIZABETH TILLEY 
Focus on small but 
crucial changes 


Sanitation economics researcher, 
University of Malawi, The Polytechnic, 
Blantyre 


I came to Malawi in 2015 after 9 years as a 
project officer and PhD student at the Swiss 
Federal Institute of Aquatic Science and 
Technology in Diibendorf. [had worked on san- 
itation projects in Nepal, South Africa, Tanza- 
nia and Nicaragua. Most of my workin Malawi 
now is teaching and supervising master’s and 
PhD students on such projects as making fuel 
briquettes out of dried faecal sludge. We work 
on ‘shit-flow diagrams’ — trying to map and 
understand where excreta is being generated 
and how much of it is being treated. 

We have a very bad Internet connection, 
and it’s a barrier to downloading files or 
making Skype calls. We don't have subscrip- 
tions to journals. We have 30 computers for 
4,000 students. Paper and photocopying are 
very expensive. We go days without water to 
even flush the toilets. At the university, we 
don't have toilet paper, so I bring my own each 
day and I keep a secret bottle of soap. 

The research agenda in Malawi is driven by 
big donors from the global north, including 
national governments such as Norway and 
Japan, and private donors and non-govern- 
mental organizations. Very little funding goes 
to African researchers for work on topics that 
they've identified themselves. The fact that 
northern countries offer funding opportuni- 
ties to those in the global south is an excellent 
form of development. 

But some proposals call for the participa- 
tion of a southern partner with no require- 
ments for the division of funding. This means 
that the southern partner is sometimes given 
a limited budget for limited work that has 
limited impact. 

I would encourage northern researchers to 
think about doing sabbaticals in African univer- 
sities. It gives the southern researcher a chance 
to focus on publications or research, to be 
exposed to new ideas and methods, and to con- 
nect with a broader network, and the northern 
researcher can learn how things operate in the 
south and appreciate what works well at home. 

When you start to think about how crushing 
the whole system is, you can go crazy. [hada 
student who just wrote to tell me that he got 
into a master’s programme in the United States, 
and to thank me for the reference letter. He's so 
excited, and that’s the kind of thing I can hold 
on to for a couple of months. m 


INTERVIEWS BY JOSIE GLAUSIUSZ 
These interviews have been edited for clarity and 
length. 
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Ua SCIENCE FICTION 


BY GRIFFIN AYAZ TYREE 


y body was at the dining table 
where Id left it. 
Postured differently now: bolt 


upright, one hand clenched around the 
stem of a wine glass with the fingers 
going numb — it was a wonder I hadn't 
snapped the thing in half. 

It hadn't been this tense before. 

Nick squirmed in his seat, started to 
say something but then thought better 
of it. He was a paying client; no need to 
explain the aborted session. I pushed 
back my chair and stood. 

“Can... can you tell her I’m sorry?” 
he stammered. 

I felt it then, the knot in my stomach, 
the lingering heat on my cheeks where 
Id flushed in anger. 

“Argument?” I asked. 

Anod. 

“It’s common. There’sa lot to rehash 
the first time.” 

I ejected his wife's loop-drive from 
my cranial rig and laid it on the table. 

“Same time next week?” His voice was 
hopeful and a little strained. 

I smiled. Of course. How else would I pay 
the bills? 


On the train home, a sudden craving for 
noodle soup. Shoyu broth, sliced pork, soft- 
boiled egg — nothing I'd ever liked. 

Is that you, Lana? 

Only the steady hum of the train in reply. 
I leant my head against the window and 
dozed off while rain battered the plexiglass. 
When I came to, the weather had cleared and 
Lana was ready to talk. 

What am I? 

She knew the answer to that; everyone 
did. Neural connections from a dying brain, 
stored in binary and back-translated into 
another body. Next question. 

Why am I here? 

Because running someone’s persona 
through a cranial rig leaves traces behind 
— electric patterns worn like footprints 
in snow. 

Huh. 

She was taking it well, relatively. Most 
would have filled my blood with stress hor- 
mones by now. 

My stomach 


> NATURE.COM growled and my 
Follow Futures: thoughts — our 
W @NatureFutures thoughts, I guess — 
Ei go.naturecom/mtoodm drifted back to food. 
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al 


SURROGATE 


Total recall. 


Lana loved ramen but Nick was never in the 
mood, wouldn't even humour her. When she 
got sick hed bring it home everyday, in big 
Styrofoam containers, but by then the chemo 
had her doubled-over and throwing up most 
of the time. 


So what happens now? 

Right. The question I'd hoped to put off 
as long as I could. At least I could misdirect. 

Now we eat. 


Tyler never asked existential questions. 
He was six when he died; freak accident, 
drowned in a pool. Never wondered what it 
meant to be ‘reset’ with the cranial rig. Too 
young to know that was standard procedure. 

He was a brat, though: he'd throw a fit 
every time he saw a toy (dinosaurs were 
his favourite) and he wouldn't let up unless 
I bought it for him. There was a toy store 
across from the ramen bar; I tried not to look 
at it for too long. 

That’ classic Nick, Lana said between big 
gulps of savoury broth, as if it was her doing 
the eating and not me. I’m not even cold in 
the ground and he’s out looking for someone 
younger. 

Tyler’s parents would hire me for six hours 
at a time, whole days spent at the park or the 
zoo or the state fair. When the session was 
over I'd find myself holding a new action 
figure or oversized soft toy — the kid was 
true to form. 

He was a brat, like I said. He was tiring. 

And once the parents realized they had 
a choice — when to meet, whether to meet 
— they called less often. Eventually, they 
stopped altogether. 

As gently, as inconspicuously as I could, 
I slid my left hand up my neck to feel for 
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the small oval button on the side of the 
cranial rig... 

He expects me to be fine. Like we just 
‘separated’. Like I’m not even dead. 

It was the perfect arrangement for the 
bereaved, really: to them, their son was 
still alive on a loop drive somewhere 
— nothing lost, nothing to mourn. 
And they could see him again when- 
ever they wanted. 

To the other party, though, it was... 
Lana put it best: The worst parts of 
being alive, none of the perks of being 
dead. It sucks. 

I was sweating then — I think it was 
the soup, the heat. I don’t think it 
was nerves. 

Then why were my hands shaking? 

Look, whatever you have to do, just 
do it quickly. 

It always came to this: finger over the 
button, poised to erase an afterthought 
of someone else’s refusal to grieve. 

It always came to this, and I always 
made the same choice. 


We bought a housekeeping unit when I was 
a kid, one of those low-end models with a 
hacked-together AI. It had a real problem 
with steps; it would fall to the side and give 
offa strange synthetic groan. 

Naturally I sent it up and down stairs 
every chance I got. 

My mother would chide me for it: “Dont 
be cruel, child” 

But it isn’t real, I'd say, it wouldn’t even 
matter if I knocked it over. 

“Tt would matter a great deal to you. To the 
person you are.” 

Mom died without a loop drive. Gone and 
lost forever, some would say. But there are 
parts of her I'll carry to my grave. 


After dinner I emerged into the cold air 
and made a bee-line for the toy shop. Lana 
helped me pick out a stuffed dinosaur from 
the display, a little blue Stegosaurus. 

He'll love it. 

Yeah, if he hasrtt already seen. 

It was a ten-minute walk to the train sta- 
tion through empty streets. I used to hate 
that kind of thing, the solitude, the fear — 
but I haven't felt alone ina very long time. = 


Griffin Ayaz Tyree is a psychiatry-bound 
medical student. He lives in San Diego with 
his partner, and several small birds who 
leave droppings on their patio. He tweets 

as @ayazyouyaz. 
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